Foundations of Artificial Intelligence
What AI actually is, where it came from, why symbolic AI failed, the philosophical debates that matter today, the rational agent framework, competing paradigms, and the state of the field.
AI is not a single thing. It is a collection of techniques for building systems that optimise toward a goal using data. The magic disappears when you see the math.
Biological Intelligence
- Embodied โ tied to a body with survival pressures
- Developed through evolution over millions of years
- Energy-efficient: 20 watts powers the human brain
- Generalises from very few examples (few-shot by default)
- Handles open-ended, ambiguous, novel situations naturally
Machine Intelligence
- Disembodied โ no physical survival pressure
- Built through training on human-generated data
- Energy-hungry: GPT-4 training cost ~$100M in compute
- Requires millionsโbillions of examples to learn patterns
- Brittle outside training distribution; exceptional at defined tasks
- Ability to learn from experience
- Ability to solve novel problems
- Ability to understand and generate language
- Ability to reason under uncertainty
There is no single universally agreed definition of AI. Here are the four most cited, each emphasising a different aspect:
"The science and engineering of making intelligent machines, especially intelligent computer programs."
"The study of agents that receive percepts from the environment and perform actions." The textbook definition.
"Building systems that can perceive, reason, learn, and act โ performing tasks that traditionally required human intelligence."
"An engineered system that generates outputs such as content, forecasts, recommendations, or decisions for a given set of human-defined objectives."
The common thread across all four: AI systems are designed to optimise for a goal. Not magic โ sophisticated optimisation machines trained on data. Whether the goal is to win a chess game, classify an image, or predict the next word in a sentence, the mechanism is the same: minimise a loss, maximise a reward.
"AI is whatever humans haven't figured out how to do yet. Once we have an algorithm for it, we stop calling it AI."
Tesler, L. โ coined at Xerox PARC, ~1970s | Larry Tesler's Theorem (the AI Effect)| Term | Formal Definition | Emerged | Canonical Example |
|---|---|---|---|
| Artificial Intelligence | Any technique enabling machines to mimic intelligent behaviour | 1956 โ Dartmouth Conference | Chess programs, expert systems |
| Machine Learning | Algorithms that learn from data without explicit programming | 1959 โ Arthur Samuel | Decision trees, XGBoost, SVMs |
| Deep Learning | ML using multi-layer neural networks learning hierarchical representations | ~2006 / mainstream 2012 | GPT-4, ResNet, AlphaFold, DALL-E |
| Data Science | Extracting insight from data using statistics, ML, and visualisation | 2000s | Business analytics, dashboards |
All AI systems โ from the simplest spam filter to GPT-4 โ can be understood through four fundamental capabilities. These aren't arbitrary categories: they map directly to the architectural components you'll encounter in every system throughout this curriculum.
Perceive
Receive input from the environment. Sensors, cameras, microphones, text APIs, database queries. The AI's window onto the world begins here.
Example: GPT-4 Vision reading an image; a self-driving car's LiDAR scanning the road ahead.
Reason
Analyse, compare, infer, and plan. Apply logic, pattern matching, or learned heuristics to understand input and determine what to do next.
Example: A classifier deciding "this email is spam"; an LLM using chain-of-thought to plan multi-step solutions.
Learn
Update the internal model based on feedback, new data, or experience. This is what separates machine learning from traditional rule-based software.
Example: A model's weights adjusting during gradient descent; a recommender system adapting to user feedback.
Act
Generate outputs: text, decisions, control signals, code, images. Take actions in the world โ and potentially change the environment for the next perception cycle.
Example: GPT generating a response; a robot arm moving to pick up an object.
Perceive is the entry point. In large language models, the perception layer is the tokeniser and embedding lookup โ raw text transformed into high-dimensional vectors the network can process. In computer vision systems, it's the image pipeline: resize, normalise, encode. In robotics, it's sensor fusion from cameras, accelerometers, and LiDAR.
Reason sits at the heart of what makes AI feel intelligent. In a transformer model, attention mechanisms allow every token to reason about every other token in the context window. In a decision tree, reasoning is the sequence of feature comparisons that route an input to a leaf node prediction.
Learn is the mechanism that makes modern AI powerful. Traditional software is programmed with explicit rules. Learning systems instead infer rules from examples. A loss function quantifies error, and an optimiser updates parameters to reduce it โ repeated millions of times across millions of examples.
Act closes the loop. For a language model, acting means sampling the next token from a probability distribution. For a recommendation engine, surfacing the top-k items. For a robotic system, issuing motor commands. The action taken modifies the environment, producing new perception data.
A self-driving car PERCEIVES the road, REASONS about obstacles, has LEARNED from millions of miles of training data, and ACTS by steering. ChatGPT PERCEIVES your text, REASONS using learned patterns, and ACTS by generating tokens one at a time.
These terms are often used interchangeably in the press, but they have precise relationships โ each is a subset of the one above. Consider ChatGPT: it is simultaneously correct to call it AI, ML, and Deep Learning โ each statement is true and increasingly specific.
However, the reverse implication does not hold. Not all AI is ML โ expert systems encode human knowledge directly as rules, without learning from data. Not all AI is deep learning โ a random forest or naive Bayes classifier are ML but not deep learning. Data Science is a separate discipline focused on extracting business insight via statistics, visualisation, and ML โ it overlaps AI but is not a true subset.
- Systems that learn rules from data without explicit programming
- Improving in performance with more and better data
- Generalising learned patterns to new, unseen examples
- The dominant paradigm powering modern AI products
- Just statistics (it adds architectural inductive biases statistics lacks)
- Always neural networks (trees, SVMs, and ensembles are ML too)
- Magic or sentient โ it is function approximation
- Guaranteed to work on any problem without careful engineering
Common misuse: "We use AI" usually means "we use ML" and often specifically means "we use a trained model." Precision matters โ especially when evaluating vendor claims. Ask: Is it rule-based? Trained? On what data? Measured by what metric?
AI systems can be classified along two axes: scope (what range of tasks?) and capability level (how does it compare to human intelligence?).
ANI โ Narrow AI
Excels at one specific task. Cannot generalise outside its training domain. All AI today is ANI โ GPT-4 is brilliant at text but cannot physically navigate a room. AlphaGo beat world champions at Go but cannot play chess.
AGI โ General AI
Can perform any intellectual task a human can. Transfers knowledge across completely different domains without retraining. Not yet achieved. Timeline: 5โ20+ years (highly debated). OpenAI's stated mission.
ASI โ Superintelligence
Surpasses human intelligence in every domain. Could recursively self-improve. Purely theoretical. Motivates AI safety and alignment research today โ the "singularity" discussed by Bostrom, Yudkowsky, Tegmark.
- Fluent, coherent text generation at scale
- Image classification (often superhuman accuracy)
- Code writing, debugging, and refactoring
- High-quality language translation
- Strategic game playing (Go, Chess, StarCraft II)
- Protein structure prediction (AlphaFold)
- Reliable reasoning about novel physical situations
- Maintaining consistent long-term plans over time
- Transferring skills across domains without retraining
- Acting reliably in open-ended, unpredictable environments
- Consistently avoiding hallucination and factual error
Public discourse around AI is saturated with misconceptions โ drawn from science fiction, corporate marketing, and media sensationalism. Before going deeper into mechanisms, it is worth explicitly naming what AI is not.
Not Like the Movies
HAL 9000, Skynet, and Samantha from Her are not realistic portrayals. Real AI systems are narrow, brittle outside their training domain, and have no goals of their own โ they optimise whatever objective they're given.
Not Magic โ It's Statistics
A language model predicts the most likely next token given all previous tokens. An image classifier outputs a probability distribution over labels. Every "intelligent" output is the result of mathematical optimisation on data.
Not Sentient or Conscious
Current AI systems have no subjective experience, emotions, or desires. They process inputs and produce outputs. Whether future AI could be conscious is a genuine philosophical question โ today's systems are not.
Not Infallible
AI systems fail in unexpected ways. They hallucinate facts, exhibit bias inherited from training data, and can be fooled by adversarial examples. They are tools with specific, well-characterised failure modes โ not oracles.
The goal of this documentation is to replace awe with understanding. Every impressive AI output is traceable to training data, an objective function, and an optimisation algorithm. The mystery evaporates when you see the mechanism.
AI has followed a pattern of hype โ disillusionment โ breakthrough โ twice. Understanding why it failed helps you understand why deep learning succeeded where everything before it failed.
AI history is a story of alternating euphoria and collapse. Understanding why each era gave way to the next โ not just when โ is the best inoculation against misreading hype today. The pattern: overpromise โ underfund โ winter โ unexpected breakthrough โ repeat.
Long before the first computer was built, mathematicians and philosophers dreamed of mechanising thought. Gottfried Wilhelm Leibniz (1646โ1716) envisioned a Characteristica Universalis โ a universal symbolic language that could encode all human knowledge โ and a calculus ratiocinator that could reason over it mechanically. This dream of reducing reasoning to symbol manipulation would resurface, unchanged in spirit, at the Dartmouth Conference three centuries later.
George Boole (1854) formalised logic as algebra โ reducing true/false statements to 0s and 1s and defining the operations AND, OR, NOT. This was the mathematical bedrock everything else was built on. Ada Lovelace (1840s), writing annotations on Babbage's Analytical Engine, described the first published algorithm intended for a computing machine โ and crucially, articulated both its power and its limits: the machine can only do what it is told. Claude Shannon (1938) showed that Boolean algebra could be implemented in electronic circuits, connecting Boole's logic to physical hardware. And finally McCulloch and Pitts (1943) proposed the first mathematical model of a neuron โ a binary threshold unit that fires when its weighted inputs exceed a threshold. Every modern neural network traces its ancestry directly to that 1943 paper.
These five contributions โ symbolic reasoning, Boolean logic, algorithms, logic circuits, and artificial neurons โ were the intellectual raw materials that the Dartmouth group assembled into a new field in 1956.
In 1950 Alan Turing published "Computing Machinery and Intelligence" โ opening with the question "Can machines think?" He proposed the Imitation Game as a pragmatic operational test: if a machine can convince a human judge, communicating only via text, that it is human, we have sufficient practical grounds to attribute intelligence to it. Turing was careful not to define intelligence โ he proposed the test precisely to sidestep that philosophical quagmire. He also described the "child machine" concept: rather than programming adult intelligence directly, build a machine that learns. This was the first clear articulation of what we now call machine learning.
Six years later, in the summer of 1956, John McCarthy, Marvin Minsky, Claude Shannon, and Herbert Simon convened at Dartmouth College and officially founded the field of Artificial Intelligence. Their proposal was breathtakingly optimistic: "Every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it." Minsky later claimed that "within a generation the problem of creating AI will substantially be solved." That generation passed without resolution. The Dartmouth group were brilliant researchers who massively underestimated three things: the importance of data, the role of perception and embodiment, and the sheer computational depth required.
- AI as a formal, dedicated research field
- Symbolic manipulation is a valid form of reasoning
- Computers can exhibit intelligent behaviour
- The field needs dedicated researchers and funding
- Intelligence timeline vastly underestimated
- Ignored the critical importance of data
- Ignored embodiment, perception, and grounding
- Assumed logic alone equals intelligence
The decade following Dartmouth was genuinely remarkable. Early AI programs demonstrated things no one had seen before: a machine proving mathematical theorems, a program that could hold a conversation, a system that learned to play checkers by playing against itself. In 1959 Arthur Samuel's checkers program โ which improved by self-play โ gave the field a new term: machine learning. The optimism was not irrational โ the demos were real. The problem was that toy domains don't scale.
The AI winters are the most important episodes in AI history for developing intuition about the current moment. They were not caused by bad science. They were caused by a structural gap: researchers promised capabilities that required data and compute that didn't yet exist, and funders cut support when those promises weren't kept on schedule.
Why AI Failed (Technical)
- Combinatorial explosion: symbolic search cannot scale to real-world problem sizes
- No data: rule-based systems require hand-crafted knowledge for every fact
- No compute: networks were too slow to train at any useful scale
- Brittle: systems collapse immediately outside their narrow training domain
Why AI Failed (Institutional)
- Overpromised: researchers claimed 5โ10 year timelines to funders, repeatedly
- Underfunded follow-through: funding cut the moment hype peaked
- Misaligned incentives: demos optimised for impressiveness, not robustness
- The AI Effect: as each thing worked, it stopped being called AI
| Winter | Trigger | What Collapsed | Root Cause | What Survived |
|---|---|---|---|---|
| First (1974โ80) | Lighthill Report (UK, 1973); DARPA cuts | General-purpose AI, machine translation, symbolic reasoning | Combinatorial explosion; overpromised timelines to funders | Specialised systems; early expert system research; Prolog |
| Second (1987โ93) | Expert system maintenance failures; Lisp machine market collapse | Commercial expert systems; DARPA Strategic Computing Program | Expert systems too brittle and expensive to maintain at real-world scale | Backpropagation (1986 rediscovery); statistical ML; RL foundations |
Every AI winter was caused by the same three gaps: not enough DATA, not enough COMPUTE, and not enough ALGORITHMIC insight. By 2012, all three gaps had been filled simultaneously for the first time in history.
While AI winters suppressed funding, a quieter revolution was accumulating. Statistical machine learning โ SVMs, random forests, gradient boosting โ replaced brittle symbolic systems with principled, mathematically-grounded methods. Meanwhile, the web was producing data at a scale no previous generation could have imagined, and GPU hardware was becoming cheap enough to train meaningfully large networks.
Data
Web-scale datasets: ImageNet (1.2M images), Common Crawl (trillions of words), user-generated content from social platforms. Training data grew 1000ร in a decade โ providing the signal deep networks needed to learn.
Compute
NVIDIA GPUs and CUDA (2007) gave 100โ1000ร speedup for matrix operations via massive parallelism. What took weeks on CPUs took hours on GPUs. Training deep networks became economically viable for the first time.
Algorithms
ReLU activation (solved vanishing gradients), Dropout regularisation (prevented overfitting), Batch Normalisation (stabilised training), residual connections (enabled very deep networks). Each independently helpful; together transformative.
AlexNet's ImageNet victory in 2012 was the moment the field changed permanently. Krizhevsky, Sutskever, and Hinton's network achieved 15.3% top-5 error vs. the runner-up's 26.2% โ an 11-point gap that shocked the computer vision community. The model was trained on two consumer GPUs in five days. Within 18 months, every major research lab had pivoted entirely to deep neural networks. The pattern: one shocking result, then near-universal adoption.
| Year | Event | Why It Matters |
|---|---|---|
| โ 2012 | AlexNet โ Hinton, Krizhevsky, Sutskever | Wins ImageNet by 10.8-point margin. GPU-trained CNN. Every major lab pivots to deep learning within 18 months. The modern AI era begins here. |
| 2013 | Word2Vec โ Mikolov (Google) | Word embeddings: words as dense vectors. "king โ man + woman โ queen." Language has geometric structure. Foundation of all modern NLP. |
| 2014 | GANs โ Goodfellow et al. | Generative Adversarial Networks: generator vs discriminator. First system to produce photorealistic images. The generative AI era begins conceptually. |
| 2015 | ResNet โ He et al. (Microsoft) | 152-layer network wins ImageNet. Residual skip connections solved vanishing gradients at depth. Made arbitrarily deep networks trainable. |
| โ 2016 | AlphaGo โ DeepMind | Beats Lee Sedol 4-1 at Go โ more positions than atoms in the observable universe. MCTS + deep RL. Arrived 10โ20 years ahead of expert predictions. |
| โ 2017 | "Attention Is All You Need" โ Vaswani et al. | The Transformer architecture. Self-attention replaces RNNs entirely. Processes all tokens in parallel. Ancestor of GPT, BERT, DALL-E, Whisper, AlphaFold โ every major AI system built since 2018. |
| 2018 | BERT & GPT-1 โ Google & OpenAI | Pre-train on unlabelled text โ fine-tune on tasks. NLP benchmark records shattered across the board. The pre-train โ fine-tune paradigm established. |
| โ 2020 | GPT-3 โ 175B parameters | Few-shot learning at scale: give three examples in the prompt, model solves the task without fine-tuning. Foundation model era begins. OpenAI initially called it "too dangerous to release." |
| 2021 | DALL-E, CLIP, AlphaFold 2 | Multimodal AI: images and text in the same embedding space. AlphaFold 2 solves protein structure prediction โ a 50-year open problem in biology โ in a single paper. |
The Transformer deserves a dedicated note. "Attention Is All You Need" (Vaswani et al., 2017) replaced recurrent networks with a single elegant mechanism: self-attention, which allows every token to attend to every other token in parallel. This unlocked two things simultaneously โ full context access (no sequential bottleneck) and massive parallelism (scale with compute). The result: BERT, GPT, T5, LLaMA, Whisper, DALL-E, AlphaFold, and virtually every transformative AI system since 2018 is built on Transformers. It is arguably the most consequential single paper in AI history.
ChatGPT launched on November 30, 2022. It reached 1 million users in 5 days and 100 million users in 60 days โ the fastest consumer technology adoption in recorded history. Instagram took 2.5 years to reach the same milestone. Twitter took 5 years. This wasn't a product launch โ it was a cultural event. For the first time, a general-purpose AI system was accessible to anyone with a browser, and it worked well enough to be genuinely useful for everyday tasks.
| When | Event | Why It Matters |
|---|---|---|
| โ Nov 2022 | ChatGPT โ OpenAI | 100M users in 60 days โ fastest consumer adoption ever. RLHF alignment made LLMs genuinely helpful. AI moved permanently into mainstream public discourse. |
| โ Mar 2023 | GPT-4 โ OpenAI | Multimodal (text + images). Near-human on bar exam, medical licensing, and professional benchmarks. Set the capability standard that triggered the multi-model race. |
| Mar 2023 | Claude (Anthropic) & Gemini (Google) | Multi-model competitive landscape emerges. Constitutional AI alignment (Anthropic) and multimodal training at Google scale. No single company dominates โ the race is on. |
| โ Jul 2023 | LLaMA 2 โ Meta (open weights) | Open-source LLMs publicly released. Anyone with a GPU can run, fine-tune, or modify frontier-class models. Democratised AI development and sparked the open-source ecosystem. |
| โ 2024โ2026 | Reasoning Models & Agentic AI | o1, o3, DeepSeek-R1: models that think before answering. Claude 3.7, Gemini 2: long-context, multimodal, agentic. Computer-use agents. Physical AI (humanoid robots + LLMs). Current frontier. |
Unlike the boom-bust cycles of the past, the current AI wave is reinforced by a self-amplifying economic flywheel. Better models attract more users. More users generate more revenue and data. More revenue funds more compute. More compute trains better models. The cycle has no obvious brake โ which is precisely why the pace of improvement has been relentlessly accelerating since 2012, and why projection-based estimates consistently underestimate where we will be in three years.
The flywheel has been spinning faster every year since 2012. Training compute for frontier models has grown roughly 4ร per year. GPT-4's training run cost an estimated $100M. Next-generation frontier models are projected at $1B+. This isn't reckless spending โ the returns on better models are large enough that the economics reinforce continued investment. Unlike the AI winters, there is no plausible external shock that would stop this cycle โ only the discovery of fundamental capability limits, which has not yet materialised.
- Two AI winters caused by the same three gaps: insufficient data, compute, and algorithms โ not bad science
- AlexNet (2012) and Transformer (2017) are the two true inflection points of modern AI
- ChatGPT (Nov 2022) โ 100M users in 60 days โ made AI mainstream overnight; Instagram took 2.5 years
- Expert systems failed because knowledge cannot be fully encoded as rules โ ML learns it from data instead
- The AI flywheel: better models โ more investment โ more compute โ more data โ better models
- Understanding AI winters is essential for evaluating today's hype with appropriate skepticism and calibration
- 2024โ2026: reasoning models + agentic AI = the current research and product frontier
Symbolic AI is commonly omitted from modern documentation. This is a mistake. Understanding why symbolic AI dominated for 30 years โ and exactly why it failed to scale โ is what explains why neural networks won. More importantly, it reveals what neural networks still cannot do.
The philosopher John Haugeland coined the term GOFAI (Good Old-Fashioned AI) in 1985 to describe the dominant paradigm that had governed AI research since Dartmouth. Its core assumption: "Intelligence is symbol manipulation according to rules." Feed a machine symbols representing the world, give it logical rules for manipulating those symbols, and intelligence will emerge. The physical symbol system hypothesis (Newell & Simon, 1976) formalised this: "A physical symbol system has the necessary and sufficient means for general intelligent action." This claim was bold, wrong in the absolute sense, and enormously productive for 30 years.
Two schools developed within GOFAI. The logic-based school (Newell, Simon) grounded AI in formal logic and theorem proving โ intelligence as deduction. The knowledge-based school (Minsky, McCarthy) focused on representing domain knowledge richly enough that a system could reason over it โ intelligence as structured lookup and inference. Both shared the same fatal assumption: that the hard part of intelligence is the reasoning engine, not the knowledge itself.
- Explicit rules written by human experts
- Fully interpretable โ you can read the logic
- Fails hard on edge cases rules don't cover
- No data needed โ rules are manual
- Dominated 1956โ1986
- Patterns learned directly from data
- Black box โ hard to interpret weights
- Robust to variation and noise
- Needs large labelled datasets
- Dominated 1986โpresent
Before an AI system can reason, it must represent knowledge. GOFAI researchers developed four major representational frameworks โ each with a distinct philosophy about what knowledge is and how it should be stored.
Semantic Networks
Nodes = concepts, edges = typed relationships. "Animal โ IS-A โ Mammal โ IS-A โ Dog โ HAS โ Fur". Inheritance flows through IS-A links โ Dog automatically inherits all Mammal properties. Precursor to modern knowledge graphs (Wikidata, Google Knowledge Graph, RAG retrieval indexes).
Frames (Minsky, 1974)
Structured objects with named slots and default values โ essentially a class/struct for knowledge. A "Car" frame has slots: color (default: red), wheels (default: 4), engine: present. Frames support inheritance and overriding. Directly influenced object-oriented programming.
Ontologies
Formal specification of all concepts and relationships within a domain. Medical: SNOMED CT (350,000+ concepts). Web: OWL, RDF standards. The Cyc project (Lenat, 1984โpresent): 25M+ rules encoding common-sense knowledge over 40 years. Modern AI: ontologies power knowledge graphs in RAG pipelines.
Production Rules
IF [condition] THEN [action] โ the basis of all expert systems. Forward chaining: data-driven, fire rules that match current known facts, derive new facts until goal reached. Backward chaining: goal-driven, work backward from desired conclusion to find what must be true. Prolog uses backward chaining natively.
FRAME: Car
slots:
color: default=red, type=string
wheels: default=4, type=integer
engine: default=present, type=boolean
owner: default=nil, type=Person-Frame
FRAME: SportsCar (inherits: Car)
overrides:
turbo: default=true
top_speed: default=250 // km/h, overrides Car default
Expert systems were the first commercially successful AI technology. The idea: interview a domain expert, encode their knowledge as production rules, build an inference engine to fire those rules. In the late 1970s and throughout the 1980s, this actually worked โ inside carefully bounded domains with well-defined rules.
// Rule 52
IF organism.staining_reaction = "gram-negative"
AND organism.morphology = "rod"
AND patient.compromised_host = true
THEN organism.identity = "Pseudomonas" (confidence: 0.6)
| System | Domain | Year | Rules | Key Achievement |
|---|---|---|---|---|
| DENDRAL | Chemistry | 1965 | 500+ | First automated scientific reasoning โ identified organic molecules from mass spectrometry |
| MYCIN | Medical diagnosis | 1972 | 600 | 65% accuracy vs 42% for Stanford medical students. Never deployed โ liability concerns. |
| XCON / R1 | Computer configuration | 1980 | 2,500 | Saved DEC $40M/year configuring VAX systems. Proved AI had real commercial ROI. |
| PROSPECTOR | Mineral exploration | 1978 | 1,000 | Discovered a $100M molybdenum deposit. Geological knowledge + probabilistic inference. |
| Cyc | Common-sense reasoning | 1984 | 1M+ | Still maintained. World's largest manually encoded knowledge base. Represents the limit of the approach. |
What Expert Systems Got Right
- Formal representation of domain knowledge works within bounds
- Explainable by design โ every decision traceable to a rule
- Can capture rare cases that training data won't cover
- Still used in medical, legal, and financial rule engines today
Why Expert Systems Failed
- Knowledge bottleneck: experts can't articulate their tacit knowledge as rules
- Brittleness: catastrophic failure on any case the rules don't cover
- Maintenance nightmare: every edge case needs a manual rule update
- Can't learn: no mechanism to improve from new data or outcomes
Logic is the mathematics of valid inference. GOFAI researchers hoped to give AI the ability to reason with the same rigor as formal proof โ deriving new knowledge from existing knowledge with absolute certainty. Three levels of expressive power matter for AI.
Propositional Logic
True/false propositions with connectives: AND (โง), OR (โจ), NOT (ยฌ), IMPLIES (โ). Fully decidable. Too limited: can't express properties of objects or quantify over them.
P โ Q // if it rains, ground is wet
P // it rains (given)
โด Q // modus ponens: ground is wet
First-Order Logic (FOL)
Adds variables, predicates, and quantifiers. Used in theorem provers, Prolog, OWL ontologies. Semi-decidable โ powerful but computationally expensive.
Human(Socrates) // Socrates is human
โด Mortal(Socrates) // universal instantiation
Non-Monotonic Reasoning
Classical logic: once proved, always true. Real world: "Birds fly... unless they're penguins." Non-monotonic reasoning handles exceptions and defaults โ conclusions can be retracted when new information arrives. Essential for commonsense AI.
human(socrates). human(plato).
mortal(X) :- human(X). /* โX: human(X) โ mortal(X) */
/* ?- mortal(socrates). โ true */
/* ?- mortal(zeus). โ false (not in KB) */
bird(tweety). flies(X) :- bird(X). /* birds fly by default */
penguin(opus). bird(opus) :- penguin(opus).
flies(X) :- penguin(X), !, fail. /* penguins don't โ non-monotonic exception */
If knowledge representation is how GOFAI stores the world, search is how it reasons through it. AI as search through a state space was the dominant problem-solving paradigm from 1956โ1986. Define a state (current configuration), a set of actions (legal transitions), and a goal (desired configuration). The AI finds a sequence of actions from start to goal. Simple in principle โ catastrophically hard in practice as state spaces grow.
Uninformed Search
- BFS: explore layer by layer โ finds shortest path, memory expensive O(bd)
- DFS: explore deep first โ memory efficient O(d), may not find shortest path
- Iterative Deepening DFS: optimal like BFS, memory like DFS โ best of both worlds
Informed (Heuristic) Search
- A*: f(n) = g(n) + h(n) โ actual cost + heuristic estimate. Optimal if h is admissible (never overestimates actual cost)
- Greedy best-first: h(n) only โ fast but not optimal
- Used in: pathfinding, GPS navigation, game AI, robot motion planning
| Algorithm | Complete? | Optimal? | Time | Space | Best For |
|---|---|---|---|---|---|
| BFS | โ Yes | โ Yes (unweighted) | O(bd) | O(bd) | Shortest path, small state spaces |
| DFS | โ Yes | โ No | O(bm) | O(bm) | Memory-constrained, finding any path |
| A* | โ Yes | โ Yes (admissible h) | O(bd) | O(bd) | Pathfinding with good heuristic |
| Minimax | โ Yes | โ Yes | O(bm) | O(bm) | 2-player zero-sum games |
| Alpha-Beta | โ Yes | โ Yes | O(bm/2) | O(bm) | Games โ 2ร faster with pruning |
GOFAI wasn't bad science โ it produced real results in bounded domains. It failed because three fundamental problems proved impossible to solve within its paradigm. These aren't engineering problems. They are conceptual limits on what rule-based symbol manipulation can express.
The Frame Problem
McCarthy & Hayes (1969): when an action occurs, what changes and what stays the same? "I move a cup โ does the coffee stay in it? Does the table change? Does gravity still apply?" Humans know intuitively what's relevant. Encoding this completeness is impossibly hard in a finite rule set.
Knowledge Bottleneck
Expert knowledge is largely tacit โ "know-how" not "know-that." Doctors, chess players, and engineers can't fully articulate the pattern recognition driving their decisions. This tacit knowledge is the data that neural networks learn from examples โ it can't be extracted through interviews.
Brittleness & Explosion
Symbolic systems fail immediately outside their defined domain โ no graceful degradation. Real-world ambiguity can't be captured by finite rules. 1,000 rules produce millions of interaction cases. The combinatorial explosion of exceptions makes complete rule coverage computationally intractable.
The lesson of GOFAI: Intelligence is not stored in explicit rules. It emerges from exposure to experience. This insight is the entire foundation of machine learning โ neural networks don't fail on the frame problem because they simply learn what matters from data.
- GOFAI assumption: intelligence = symbol manipulation by rules โ worked in narrow domains, failed at scale
- Knowledge representation: semantic networks, frames, ontologies, production rules โ all have modern descendants
- Expert systems worked commercially (XCON, MYCIN) but hit the knowledge acquisition bottleneck
- FOL & Prolog: powerful for formal reasoning, but undecidable and computationally expensive at scale
- A* search: optimal heuristic pathfinding โ still used in robotics, navigation, and game AI today
- Symbolic AI's failure proved that intelligence must be learned from experience, not encoded as rules
Turingโs 1950 paper opens with โI propose to consider the question, Can machines think?โ โ then immediately sidesteps it. The question matters not because we can answer it, but because our answer determines what we build and how we align it. These are not idle philosophical puzzles.
Alan Turingโs 1950 paper โComputing Machinery and Intelligenceโ is the founding document of AI philosophy. He proposed the Imitation Game as a pragmatic replacement for the unanswerable โCan machines think?โ A human interrogator communicates via text with two participants โ one human, one machine. If the interrogator cannot reliably distinguish which is which, the machine has passed. Turing predicted that by 2000, a machine would fool 30% of judges after 5 minutes. He was roughly right about the timeline, but the test itself proved easier to pass than it was to mean anything.
Critique 1: Behaviour โ Intelligence
A system could pass using clever tricks without understanding anything. ELIZA (1966) fooled users into thinking they were talking to a therapist using simple pattern matching. GPT-4 arguably passes the test today โ does that mean it โthinksโ?
Critique 2: Wrong Benchmark
A bat navigates in darkness via echolocation โ we don't test AI on that. Why is human-style conversation the benchmark for ALL intelligence? AlphaGo is superhuman at Go but would fail the Turing Test. The test is anthropocentric by design.
Critique 3: The Chinese Room
John Searle argues a system could pass the test without understanding anything. The most famous critique of the Turing Test โ see Section 2 for the full argument.
Has Anything Passed It?
Eugene Goostman (2014) claimed to pass โ controversy ensued (judges were lenient). Modern LLMs like GPT-4 routinely fool humans in short conversations. The test itself may now be obsolete as a meaningful benchmark.
| Test / Benchmark | What It Measures | Year | Current Status |
|---|---|---|---|
| Turing Test | Language imitation โ can machine fool a human? | 1950 | Easily gamed by modern LLMs โ not a meaningful bar |
| Winograd Schema | Common-sense reasoning via pronoun resolution | 2011 | LLMs solved it by 2022 โ retired as a benchmark |
| ARC Challenge | Novel visual pattern reasoning | 2019 | GPT-4 scores ~85% โ approaching human level |
| MMLU | Knowledge across 57 academic subjects | 2020 | GPT-4 ~87%, human expert ~89% โ essentially matched |
| BIG-Bench Hard | 23 hard multi-step reasoning tasks | 2022 | Frontier reasoning models approaching human performance |
In 1980 philosopher John Searle published โMinds, Brains, and Programsโ โ arguably the most influential and most debated paper in AI philosophy. His thought experiment: you are locked in a room. Through a slot, people pass slips of paper with Chinese characters. You have a rulebook: โWhen you see symbol sequence X followed by Y, write Z and pass it back.โ You follow the rules perfectly. From outside, your responses are indistinguishable from those of a fluent Chinese speaker. But you understand nothing. You are just manipulating symbols according to formal rules.
Searleโs conclusion: syntax is not sufficient for semantics. A program that processes symbols according to rules โ no matter how sophisticated โ does not thereby understand those symbols. It has the form of language processing without the content. By analogy: AI programs process language according to learned statistical rules. They produce outputs indistinguishable from understanding. But this doesnโt mean they understand.
Systems Reply
Itโs not the person who understands โ itโs the whole system (person + rules + room). Similarly, a single neuron doesnโt understand, but the brain does. You must evaluate the system, not its components in isolation.
Robot Reply
Put the room in a robot with sensors and actuators. Now symbols are connected to real-world referents โ โfireโ is associated with heat sensors. Grounding in the physical world might be what produces genuine understanding.
Brain Simulator Reply
What if the rulebook simulated every neuron in a Chinese speakerโs brain exactly? Would that produce understanding? If not, then biological neurons have no special status either โ and consciousness is unexplained by either substrate.
Searleโs Counter-Reply
All these replies move the lack of understanding around without eliminating it. The systems that result still only process symbols formally โ there is no genuine intentionality. Intentionality requires biology, Searle argues.
The Chinese Room is not a solved argument. GPT-4 is, in a very real sense, an enormously complex Chinese Room. Whether โunderstandingโ requires something beyond symbol manipulation remains genuinely open โ and the answer determines how seriously we should take AI wellbeing and alignment.
Philosopher David Chalmers (1995) distinguished easy problems of consciousness โ explaining cognitive functions like attention, memory, and behaviour โ from the Hard Problem: why does all this processing feel like something from the inside? Why isnโt cognition just computation happening โin the darkโ, without any inner experience? This question may be permanently beyond empirical investigation, because any physical description of a brain state leaves open why there is subjective experience associated with it.
The Hard Problem
Why does it โfeel like somethingโ to be conscious? Even a perfect physical description of brain activity doesnโt explain qualia โ the redness of red, the painfulness of pain. Chalmers argues this may permanently resist scientific explanation.
Functionalism
Mental states are defined by their functional role, not their physical substrate. If silicon performs the same functional operations as a brain, it thereby has mental states. This view is most sympathetic to the possibility of AI consciousness.
Integrated Information Theory
Consciousness = integrated information ฮฆ (phi). High ฮฆ = rich inner experience. A simple logic gate: ฮฆ โ 0. A human brain: high ฮฆ. Simple feedforward neural networks have very low ฮฆ; richly recurrent systems could have more.
Whether current AI systems are conscious is not a scientific question we can currently answer. It is prudent to neither confidently assert nor confidently deny machine sentience. The honest answer is: we donโt know โ and we donโt yet have the tools to find out.
Stevan Harnad (1990) identified a fundamental problem with purely symbolic AI: symbols in a dictionary are defined in terms of other symbols. โCat: a small domesticated carnivorous mammal with soft furโฆโ โ circular definitions all the way down. For humans, symbols are grounded in sensorimotor experience. You know โredโ because you have seen red. You know โhotโ because you have felt heat. For large language models, symbols are grounded only in other symbols โ the statistical contexts in which words appear across trillions of tokens of text.
- โhotโ = direct thermal sensation experienced via skin receptors
- โredโ = specific wavelength of light experienced visually
- โroughโ = tactile perception from physical touch
- Meaning anchored in sensorimotor interaction with the world
- โhotโ = statistical co-occurrence with โfireโ, โburnโ, โtemperatureโ
- โredโ = co-occurrence with โappleโ, โstop signโ, โbloodโ
- โroughโ = co-occurrence with โsandpaperโ, โtextureโ, โjaggedโ
- Meaning is pattern of word co-occurrence in training corpus only
This is why multimodal models (CLIP, GPT-4V, Gemini) and embodied robotics are active research frontiers โ they attempt to ground language in perceptual or physical experience. Whether statistical grounding in text is โsufficientโ for understanding is precisely the empirical question the field is now testing at scale.
Weak AI (Narrow AI)
Searleโs term: AI that simulates intelligence for specific tasks without genuine understanding. The system behaves as if it understands, but has no intentionality. All current AI is Weak AI โ GPT-4, AlphaGo, image classifiers. The useful and productive engineering view: build systems that work, regardless of philosophical status.
Strong AI
A system that genuinely understands โ not just simulates understanding. Would have beliefs, desires, and genuine intentionality in the philosophical sense. Searle argued this is impossible without biological substrate. Most AI researchers bypass this distinction entirely and focus on capabilities.
The practical view: for engineering purposes, the distinction doesnโt matter. Build systems that work. The question of whether they โtruly understandโ is a philosophical question that doesnโt affect whether your spam filter catches spam โ but it does affect how we think about alignment, rights, and long-term AI governance.
Multiple Intelligences (Gardner)
Eight distinct intelligences: linguistic, logical-mathematical, spatial, musical, bodily-kinaesthetic, interpersonal, intrapersonal, naturalistic. AI today dominates the first two; is largely absent from the last five.
Embodied Cognition
Intelligence is shaped by having a body that interacts with the world. Physical AI (robotics + LLMs) is the active frontier precisely because disembodied language models lack grounding in physical causality.
Cognitive Architectures
ACT-R (Anderson) and SOAR (Laird, Newell) are computational models of human cognition with procedural memory, declarative memory, and attention modules โ making testable predictions verified against human reaction time data.
- The Turing Test measures behavioural imitation, not intelligence โ modern LLMs routinely pass it, rendering it obsolete as a benchmark
- Chinese Room: syntax โ semantics โ symbol manipulation without understanding may not be intelligence; the argument remains unresolved
- The Hard Problem: why is there subjective experience at all? Science cannot currently answer this for biological or artificial systems
- Symbol grounding: LLMs know words from statistical co-occurrence patterns, not sensorimotor experience โ multimodal AI attempts to address this
- Strong AI (genuine understanding) vs Weak AI (behavioural simulation) โ all current AI is Weak AI by Searleโs definition
- These philosophical questions directly motivate alignment research: if AI can have goals and understanding, its objectives must be aligned with human values
PEAS, environment types, and the agent taxonomy โ the conceptual framework that connects classical AI to modern LLM agents. Russell & Norvig's Artificial Intelligence: A Modern Approach (AIMA) defines AI as the study of agents that perceive their environment and act to maximise their performance measure. This framework is the conceptual backbone of Domain 8 (Agentic AI) โ understanding it now is essential.
From Russell & Norvig's Artificial Intelligence: A Modern Approach โ the dominant textbook in AI education โ an agent is "anything that perceives its environment through sensors and acts upon that environment through actuators." The word "agent" is deliberately broad: it covers thermostats, chess programs, autonomous vehicles, and GPT-4 equally.
What makes an agent rational? For each possible percept sequence, a rational agent should select an action expected to maximise its performance measure, given the evidence provided by the percept sequence and whatever built-in knowledge the agent has. Rationality โ omniscience. Rationality โ perfection. Rationality = expected utility maximisation given available information.
Why does this abstraction matter? Because it unifies everything โ a thermostat perceives temperature and acts by switching heating, GPT-4 perceives your text and acts by generating tokens, a self-driving car perceives the road and acts by steering. Every AI system in this documentation can be analysed through this lens.
Rationality โ omniscience (knowing all outcomes). Rationality โ perfection (always choosing optimally). Rationality = expected utility maximisation given available information.
PEAS stands for Performance measure, Environment, Actuators, Sensors. It is the standard tool for formally specifying any AI agent task. Before building any AI system, PEAS forces you to answer four questions: What counts as success? What will the agent interact with? How can it act? What can it perceive?
| Agent | Performance Measure | Environment | Actuators | Sensors |
|---|---|---|---|---|
| Self-driving car | Safe arrival, comfort, legal compliance, efficiency | Roads, traffic, pedestrians, weather, other vehicles | Steering wheel, brakes, accelerator, horn | Cameras, LiDAR, GPS, radar, odometer |
| Medical diagnosis AI | Correct diagnosis, patient safety, cost efficiency | Patient data, records, lab results, imaging | Treatment recommendation, flag urgent cases | EMR systems, lab APIs, imaging APIs, notes |
| ChatGPT / LLM Agent | Helpful, harmless, honest; task completion | Human conversation, web, code environment | Text output, API calls, code execution, file writes | Text input, tool outputs, user feedback |
| Chess-playing AI | Win games, minimise blunders | Chess board, opponent's moves | Move selection (display update) | Board state (current position) |
Not all AI environments are created equal. The type of environment an agent operates in determines which algorithms are suitable, how much memory the agent needs, and how complex its decision-making must be. AIMA identifies six key dimensions along which environments vary.
| Property | Definition | Example: Fully X | Example: Partially X | Impact on Agent Design |
|---|---|---|---|---|
| Observable vs Partial | Can agent sense complete state? | Chess (full board visible) | Poker (opponent cards hidden) | Partial โ needs memory and belief states |
| Deterministic vs Stochastic | Are outcomes predictable? | Chess (moves are certain) | Self-driving (pedestrians random) | Stochastic โ needs probability reasoning |
| Episodic vs Sequential | Do past actions affect future? | Image classification (each independent) | Chess / Conversation (history matters) | Sequential โ needs memory |
| Static vs Dynamic | Does environment change while agent thinks? | Crossword puzzle | Real-time trading | Dynamic โ needs fast decisions |
| Discrete vs Continuous | Finite vs infinite states/actions? | Chess (countable positions) | Driving (continuous steering) | Continuous โ needs function approximation |
| Single vs Multi-agent | One or many agents? | Sudoku solver | Multiplayer games, stock market | Multi-agent โ must model other agents |
Most Challenging: Partial + Stochastic
Real-world AI tasks are almost always both. A self-driving car can't see around corners (partial) and pedestrians behave unpredictably (stochastic). This combination is why AI for physical environments is so hard.
Most Impactful: Sequential
If past matters for future โ which it does in almost all useful tasks โ the agent needs memory. This is why LLMs have context windows and why agents need long-term memory stores.
Emerging: Multi-agent
Modern AI increasingly involves multiple agents. LLM orchestration (LangGraph, AutoGen, CrewAI) is multi-agent by design. Critical for Domain 8: Agentic AI.
Russell & Norvig define five progressively sophisticated agent architectures. Each adds a new capability layer. Understanding this hierarchy maps cleanly to the spectrum from a smoke detector to a fully autonomous AI assistant.
โ Simple Reflex Agent
Condition-action rules mapping current percepts directly to actions. No memory. No history. Fast and auditable but completely brittle โ fails whenever current percept doesn't capture all relevant state.
- Thermostat, smoke detector, basic spam filter
- Limitation: useless in partially observable environments
โก Model-Based Reflex Agent
Maintains an internal state โ a model of the unobservable parts of the world. Updates state based on percepts over time. Can handle partial observability that pure reflex agents cannot.
- SLAM robot, Roomba tracking its path
- Limitation: rules still don't reason about goals
โข Goal-Based Agent
Has an explicit goal and searches for action sequences to achieve it. Uses search algorithms (A*, BFS) to plan multiple steps ahead. More flexible โ multiple paths to the same goal.
- GPS navigation, STRIPS planner, theorem provers
- Limitation: all goals binary โ doesn't handle tradeoffs
โฃ Utility-Based Agent
Uses a utility function โ graded preferences between states, not just goal/no-goal. Maximises expected utility (probability ร value). Handles stochastic outcomes naturally.
- Recommendation systems, portfolio managers, RL agents
- Foundation of decision theory and modern RLHF for LLMs
โค Learning Agent
Any agent type augmented with a learning component that modifies behaviour based on experience. Four internal components: performance element (selects actions), critic (evaluates against a standard), learning element (improves performance), problem generator (suggests exploratory actions). All modern AI systems are learning agents โ GPT learned from text, AlphaGo Zero learned from self-play.
Before an agent can search for a solution, it must formally define the problem. AIMA's five-component problem formulation is the standard method. It forces precision: what exactly is a "state"? What exactly is an "action"? What exactly is the "goal"?
| Element | Definition | Example: 8-Puzzle | Example: GPS Navigation |
|---|---|---|---|
| Initial state | Starting configuration | Random tile arrangement | Current location |
| Actions | Set of possible moves from each state | Slide tile left, right, up, down | Turn left/right, go straight |
| Transition model | Result of taking each action in each state | New tile arrangement after sliding | New position after movement |
| Goal test | Determines if current state is goal | Tiles in order 1โ8, blank at bottom-right | Current location = destination |
| Path cost | Numeric cost of a path | Number of moves taken | Distance or estimated travel time |
- Agent = anything that perceives and acts. PEAS (Performance, Environment, Actuators, Sensors) formally specifies any agent task
- Environments are: observable/partial, deterministic/stochastic, episodic/sequential, static/dynamic, discrete/continuous, single/multi-agent
- Partially observable + stochastic = most real-world tasks. This is why AI for physical environments is so hard.
- Five agent types: Reflex โ Model-based โ Goal-based โ Utility-based โ Learning โ each adds a capability layer
- State-space search: define initial state, actions, transitions, goal test, path cost โ then search. Combinatorial explosion limits this approach at scale.
- This framework directly maps to modern LLM agents (Domain 8): percepts = tool outputs + messages, actions = tool calls + text generation
AI is not one field โ it is a collection of competing intellectual traditions with different assumptions about what intelligence is and how to build it. Understanding the camps explains why researchers from different traditions argue past each other, and why hybrid approaches are gaining traction.
Connectionism is the paradigm that won. Inspired by the structure of the biological brain, it proposes that intelligence emerges from networks of simple connected units โ and that knowledge is stored not in explicit rules, but in the strengths of connections between units. A single neuron knows nothing; billions of them, connected with learned weights, produce language, vision, and reasoning.
The lineage runs from McCulloch & Pitts (1943) โ the first mathematical neuron โ through Rosenblatt's Perceptron (1957), through Rumelhart & Hinton's backpropagation (1986) which made multi-layer training practical, through LeCun's LeNet (1998) and AlexNet (2012), all the way to the Transformer (2017) and the LLM era. Every step was the same insight applied at larger scale with more data and compute.
Key Properties
- Parallel distributed processing โ thousands of units compute simultaneously
- Graceful degradation โ partial damage reduces performance gradually, not catastrophically
- Learning from examples โ weights adapt through exposure, not programming
- Statistical regularities โ finds patterns in data regardless of whether humans can articulate them
The Connectionist Timeline
- 1943 โ McCulloch-Pitts neuron: first mathematical model
- 1957 โ Rosenblatt Perceptron: first learning machine
- 1986 โ Rumelhart & Hinton: backpropagation for multi-layer nets
- 1998 โ LeCun's LeNet: CNNs on handwritten digits
- 2012 โ AlexNet: GPU-trained deep CNN wins ImageNet
- 2017 โ Transformer: attention replaces recurrence entirely
- 2022+ โ LLMs: trillion-parameter connectionist systems
For three decades the battle between symbolicism and connectionism was not a polite academic disagreement โ it was personal, bitter, and high-stakes. Minsky's famous dismissal of perceptrons (1969) killed neural network funding for a decade. Connectionists returned fire in 1986 with backpropagation. The symbolicism camp called neural networks "black box statistics". The connectionist camp called expert systems "fragile toy programs". Funding, careers, and the direction of the entire field were at stake.
- Intelligence = symbol manipulation according to rules
- Knowledge = explicit, human-readable rules and facts
- Interpretable by design โ you can read the logic
- Works with zero data โ rules are programmed in
- Generalises by logical deduction from axioms
- Can solve novel problems if rules cover them
- Brittle โ fails hard outside the defined domain
- Intelligence = statistical pattern matching in weights
- Knowledge = distributed across millions of parameters
- Black box โ learned weights are not human-readable
- Requires large labelled datasets to train
- Generalises by interpolation within training distribution
- Struggles with truly novel, out-of-distribution problems
- Robust โ degrades gracefully under noise and variation
Symbolicism's Best Argument
"Neural networks can't do compositional reasoning. They can't understand that 'the dog bit the man' is fundamentally different from 'the man bit the dog' โ they just see statistical co-occurrences. Language has recursive structure that requires rules, not statistics. Neural networks that can't count reliably can't be the basis of real intelligence."
Connectionism's Best Argument
"Where are the rules for understanding a face? For recognising speech across different accents? For generating coherent prose? The rules are impossibly complex to write โ the only viable path is to learn from data. The brain itself is a neural network, not a rule engine. Evolution didn't write logic programs โ it tuned connection weights."
The debate was largely resolved by the empirical success of deep learning in the 2010s. But symbolicism's core insight โ that structured, compositional reasoning matters โ has returned in the form of neuro-symbolic AI, chain-of-thought prompting, and structured reasoning models like o1 and DeepSeek-R1.
| Dimension | Symbolicism | Connectionism | Current Status (2026) |
|---|---|---|---|
| Core claim | Intelligence = symbol manipulation by rules | Intelligence emerges from connected simple units | Connectionism dominant; hybrid gaining |
| Key figures | McCarthy, Minsky, Newell, Simon | Rosenblatt, Rumelhart, Hinton, LeCun, Bengio | Hinton, LeCun, Bengio โ 2018 Turing Award |
| Strengths | Interpretable, logically consistent, structured | Learns from data, robust, generalises well | Both needed; neither sufficient alone |
| Weaknesses | Brittle, doesn't scale, knowledge bottleneck | Black box, data-hungry, fails OOD | Interpretability & robustness unsolved |
| Modern form | Knowledge graphs, formal verification, SMT solvers | LLMs, diffusion models, transformers | Neuro-symbolic = active frontier |
The third major paradigm takes a different starting point: the world is fundamentally uncertain, and any intelligent system must represent and reason about that uncertainty explicitly. Where symbolicism asks "what is true?", probabilistic AI asks "how confident am I, and how should new evidence update that confidence?" This is Bayes' theorem as the engine of intelligence: P(hypothesis | evidence) โ P(evidence | hypothesis) ร P(hypothesis).
Bayesian AI
Models uncertainty explicitly as probability distributions over hypotheses. Bayes' theorem: P(H|E) โ P(E|H) ร P(H). New evidence updates your prior belief to a posterior. Principled โ but computing exact posteriors is often intractable, requiring approximations: MCMC, variational inference, Laplace approximation.
Applications: spam filtering, medical diagnosis, sensor fusion, A/B testing, causal inference
Probabilistic Graphical Models
Bayesian networks: directed acyclic graphs encoding conditional dependencies โ efficient inference via variable elimination. Hidden Markov Models (HMMs): hidden states with observable outputs โ dominated speech recognition 1980sโ2010s. Gaussian Processes: non-parametric Bayesian learning with uncertainty estimates.
Still used: robotics (SLAM), bioinformatics, probabilistic programming (Stan, PyMC, Pyro)
Probabilistic AI never dominated the way symbolicism or connectionism did, but it remains indispensable in specific contexts. Robotics relies heavily on Bayesian state estimation (Kalman filters, particle filters). A/B testing and causal inference are Bayesian at their core. Modern LLM calibration โ how well a model's confidence scores match actual accuracy โ is a probabilistic question. RLHF uses Bayesian reasoning about reward models. The probabilistic paradigm's contribution is the rigorous treatment of uncertainty that pure neural approaches often lack.
How It Works
- Maintain a population of candidate solutions
- Evaluate each against a fitness function
- Select the fittest for reproduction
- Apply crossover (recombination) and mutation
- Repeat โ no gradients required
- Powerful where gradient descent fails: discontinuous objectives, black-box functions, combinatorial spaces
Modern AI Applications
- Genetic Algorithms โ solutions as bit-string chromosomes, selection pressure
- Evolution Strategies (ES) โ optimise continuous parameters; OpenAI showed ES competes with RL backprop
- NEAT โ neuroevolution: evolves both topology and weights of neural networks
- Neural Architecture Search (NAS) โ discovers superior architectures (MobileNet, EfficientNet)
- Hyperparameter optimisation โ black-box search over learning rates, depths, widths
Rodney Brooks & the Core Thesis
- MIT, late 1980s: "Intelligence without Representation" (1991)
- Explicit world models are unnecessary โ and may be counterproductive
- Subsumption architecture: layered behaviours, each suppressing lower ones
- Behaviours: avoid obstacles โ wander โ follow walls โ seek light
- Result: surprisingly robust behaviour with no search, no world model
- Contrast: GOFAI built maps of the world; Brooks robots just acted in it
Why It Matters for Modern AI
- Intelligence emerges from physical interaction with the world, not just computation
- RT-2 (Google/DeepMind) โ LLM reasoning + embodied robot actions
- ฯโ (Physical Intelligence) โ foundation model for physical manipulation
- Symbol grounding (Ch 1.4) may require embodiment to solve
- An AI that has felt heat and grasped objects has symbols grounded in experience
- Pure text-trained LLMs lack this physical grounding entirely
The frontier of AI research in 2026 is increasingly defined by the attempt to combine the strengths of both paradigms: neural perception and fluency with symbolic precision and compositionality. Neither pure approach is sufficient โ neural networks hallucinate and fail at systematic reasoning; symbolic systems can't perceive raw inputs or handle uncertainty. The hybrid is not a compromise โ it is a new paradigm that can do things neither can do alone.
Why Pure Neural Struggles
- Hallucination โ generates confident falsehoods with no symbolic grounding
- Arithmetic errors โ counting and algebra are unreliable in raw LLMs
- Logical inconsistency โ contradicts itself across long contexts
- Rule-following failures โ can't reliably follow strict constraints
- OOD brittleness โ fails on distributions far from training data
Why Neuro-Symbolic Is the Frontier
- Tool-using LLMs โ symbolic precision (calculator, code) + neural fluency
- Chain-of-thought โ structured reasoning steps in natural language
- AlphaGo / AlphaZero โ neural value/policy network + MCTS symbolic search
- AlphaGeometry (2024) โ neural proof-step generator + symbolic verifier, solves IMO problems
- Knowledge Graph + LLM โ structured facts + generative reasoning (RAG)
The five most important neuro-symbolic integrations in production AI today:
- โ Tool use โ LLMs call Python interpreters, calculators, and search APIs for symbolic precision. Neural fluency + symbolic correctness.
- โกChain-of-thought prompting โ forcing explicit intermediate reasoning steps mimics symbolic deduction chains. Each step is verifiable.
- โขAlphaGo / AlphaZero โ the neural value network supplies learned intuition; MCTS provides principled symbolic tree search. Neither alone reaches superhuman level.
- โฃRAG (Retrieval-Augmented Generation) โ a knowledge graph or vector store (symbolic) is queried by a neural LLM. Grounds generation in verifiable facts.
- โคReasoning models (o1, DeepSeek-R1) โ extended internal "thinking" with verifiable intermediate steps before producing output. The most direct implementation of neuro-symbolic reasoning in frontier LLMs.
- Connectionism: knowledge stored in weights, learned from data โ robust, scalable, but opaque. The dominant paradigm since 2012.
- Symbolicism: knowledge in explicit rules โ interpretable, compositional, but brittle and unable to scale to real-world complexity
- Probabilistic AI: uncertainty as a first-class citizen โ Bayesian networks, HMMs, still essential in robotics and calibration
- Evolutionary computation: selection + mutation โ used in NAS, hyperparameter optimisation, and black-box RL
- Embodied AI: intelligence emerges from physical interaction โ motivates robotics frontier and addresses symbol grounding
- Modern frontier is neuro-symbolic: neural perception + symbolic reasoning โ chain-of-thought, tool use, AlphaGo, AlphaGeometry
This chapter connects Domain 1's history and theory to the present, and gives you a map of where each subsequent domain fits. AI in 2026 is defined by foundation models as the new default paradigm โ but with hard limitations that motivate everything that follows in this curriculum.
The most important structural shift in AI since 2020 is the emergence of Foundation Models โ large models pre-trained on broad, internet-scale data that can be adapted to almost any downstream task via prompting, fine-tuning, or RLHF. The old paradigm: train a separate model for each task (one for translation, one for summarisation, one for classification). The new paradigm: one model does all of them, often better than the specialised predecessors.
This shift was driven by neural scaling laws (Kaplan et al., 2020): performance on downstream tasks improves predictably as a power-law function of compute, parameters, and data โ with no sign of diminishing returns until very recently. The implication: more scale reliably buys more capability, making the economics of frontier model training self-reinforcing.
Scale
Billions to trillions of parameters. Trained on internet-scale data across months on thousands of GPUs. GPT-4 estimated $100M+ to train. Next-generation frontier models: estimated $1B+. The compute requirements double every 6โ12 months.
Emergence
Capabilities that weren't explicitly trained emerge at scale โ arithmetic, code generation, multi-step reasoning, in-context learning. Below a capability threshold: zero competence. Above it: sudden, surprising ability. Emergent capabilities are scale's most striking phenomenon.
Adaptability
One model adapts to hundreds of tasks via prompting alone. No task-specific training required for most applications. Fine-tuning (LoRA, QLoRA) and RLHF allow further specialisation when needed. The pre-train โ adapt paradigm replaced train-from-scratch.
AI capability is not uniform across tasks. Understanding where AI already surpasses humans, where it approaches parity, and where it still falls short is essential for calibrating realistic expectations โ and for knowing which research problems remain open.
| Capability | Benchmark | Human Score | Best AI (2026) | Status |
|---|---|---|---|---|
| Image Classification | ImageNet Top-5 | ~95% | 99%+ | โ Surpassed |
| Speech Recognition | LibriSpeech WER | ~5% WER | 2โ4% WER | โ Surpassed |
| Reading Comprehension | SQuAD 2.0 | 89.5 F1 | 93+ F1 | โ Surpassed |
| Professional Knowledge | MMLU | 89% | 87โ90% | โ At parity |
| Code Generation | HumanEval | ~85% (experts) | 90%+ | โ At parity |
| Mathematical Reasoning | MATH (competition) | ~60% (AMC) | 70โ85% | โ Approaching |
| Common-Sense (physical) | PIQA | ~95% | 85โ90% | โ Still behind |
| Long-horizon Planning | Novel open tasks | High | Lowโmedium | โ Still behind |
OpenAI
GPT-4o, o1, o3 series. Dominant commercial position. ChatGPT ~200M+ active users. $13B Microsoft partnership and deep Azure integration. Reasoning models (o1/o3) introduced chain-of-thought as a first-class capability. Controversial governance restructuring in 2024.
Anthropic
Claude 3.5 Sonnet / Opus. Safety-focused, Constitutional AI alignment. $4B Amazon investment + AWS deployment. Long context (200K tokens). Claude 3.7 introduced extended thinking. Preferred by enterprises with compliance requirements.
Google DeepMind
Gemini 1.5 Pro / Ultra. Deep integration with Google Search and Workspace. TPU hardware advantage. AlphaFold 2 solved protein folding. AlphaCode, Gemini robotics (RT-2). Gemini 2.0 Flash: fast multimodal at scale.
Meta AI
LLaMA 3 (open weights: 8B, 70B, 405B). Democratising AI development. The LLaMA ecosystem powers most open-source fine-tuning, quantisation, and deployment. Fuelled the open-source explosion that now competes with closed models.
Mistral AI
European AI champion. Mistral Large, Codestral, Mixtral (mixture-of-experts). Efficient models with open weights. Positioned as privacy-first alternative. GDPR-aligned for European enterprise deployment.
Open Source Community
Hugging Face ecosystem (models, datasets, Spaces). GGUF/llama.cpp local inference. LoRA/QLoRA fine-tuning. DeepSeek-R1 (China, open weights). Qwen (Alibaba). Phi-3 (Microsoft). Democratised deployment without API dependency.
| Dimension | Closed Source | Open Source |
|---|---|---|
| Examples | GPT-4o, Claude 3.5, Gemini 1.5 | LLaMA 3, Mistral, DeepSeek-R1, Qwen |
| Performance | Highest capability at frontier | Increasingly competitive (within 10โ20%) |
| Cost | Per-token pricing ($0.001โ$0.10/1K tokens) | Infrastructure cost only โ near zero at scale |
| Privacy | Data sent to vendor servers | On-premise possible โ full data control |
| Customisation | Limited (fine-tune API, system prompts) | Full weight access โ fine-tune anything |
| Safety | Vendor-managed RLHF alignment | Community-managed โ variable quality |
| Transparency | Minimal (model cards, no architecture details) | Weights + often training details available |
Benchmarks are the field's primary mechanism for measuring progress โ and its primary mechanism for self-deception. The benchmark lifecycle is predictable: a new benchmark is created to test a genuine capability gap โ models improve rapidly โ benchmark saturates โ researchers create a harder benchmark. The cycle repeats every 12โ24 months.
Goodhart's Law in AI: "When a measure becomes a target, it ceases to be a good measure." AI benchmarks are routinely gamed โ through training data contamination, fine-tuning on test distributions, or optimising prompts for specific test formats. Benchmark saturation does not mean a capability is solved. ARC-AGI was designed by Franรงois Chollet specifically to resist shortcut-learning โ it remains the hardest general reasoning evaluation as of 2026.
| Benchmark | Domain | What It Tests | Human Baseline | Status |
|---|---|---|---|---|
| ImageNet | Vision | 1000-class image classification | ~95% | Saturated โ AI >99% |
| GLUE / SuperGLUE | NLP | Language understanding tasks | 87% / 89% | Saturated โ replaced |
| MMLU | Knowledge | 57 academic subjects (multiple choice) | 89% | At parity โ gameable |
| HumanEval | Coding | Python function generation from docstrings | 85% (experts) | Near parity (90%+) |
| MATH | Mathematics | Competition math (AMC/AIME level) | 60% (AMC students) | AI ~70โ85% |
| ARC-AGI | Reasoning | Novel visual pattern abstraction | 85% | AI ~40% โ hardest eval |
| BIG-Bench Hard | Diverse | 23 reasoning-heavy tasks | ~65% | AI 75%+ approaching |
| GPQA (Diamond) | Science | PhD-level biology, chemistry, physics | 65โ80% | AI 60โ70% approaching |
How Benchmarks Get Gamed
- Data contamination: test set data appears in model training corpus
- Distribution fine-tuning: model trained on benchmark training split, tested on test split
- Prompt engineering: optimising prompts for specific test formats inflates scores
- Memorisation: model recalls answers rather than reasoning to them
What Makes a Good Benchmark
- Novel problems: not solvable by pattern-matching training data
- Human-verifiable: ground truth is unambiguous
- Diverse: many task types so no single trick wins
- Dynamic: can be updated as old tasks saturate (e.g. ARC-AGI 2)
Despite the extraordinary progress of the last decade, fundamental problems remain unsolved. These are not engineering challenges awaiting more compute โ they are conceptual gaps that may require new paradigms to address. Understanding them prevents over-hyping current capabilities and points toward where research is most needed.
Hallucination & Factual Reliability
LLMs generate confident falsehoods. Retrieval-augmented approaches reduce but don't solve the problem. Root cause: models are trained to produce fluent, plausible text โ not verified facts. A model that reliably says "I don't know" when it doesn't know would be enormously valuable. Calibrated uncertainty is an open research problem.
Common Sense & Physical Reasoning
A 2-year-old knows a dropped ball falls. AI still fails simple physical scenarios. LLMs absorbed much commonsense from text but miss the causal world model behind it. The Winogrande benchmark reveals systematic errors on pronoun resolution humans find trivial. Physical reasoning requires causal world models โ not statistical text patterns.
Long-Horizon Planning
Current agents handle ~10โ50 step tasks reliably. Real-world projects span hundreds of interdependent steps over days or weeks. Errors compound: one wrong action early invalidates all downstream planning. Agents also lack persistent memory across sessions and suffer context window limits. Autonomous multi-day task completion remains out of reach.
Causal Inference
Current AI is fundamentally correlational. "Countries with more hospitals have more disease" โ a correlational model concludes hospitals cause disease. Judea Pearl's causal hierarchy (association โ intervention โ counterfactual) identifies exactly what's missing. Moving from correlation to causation is essential for reliable decision-making in medicine, policy, and science.
Sample Efficiency
A child learns "dog" from ~10 examples. GPT-4 needed hundreds of billions of tokens to achieve comparable breadth. Humans generalise from far fewer examples via strong inductive biases built by evolution. Few-shot meta-learning addresses this but current models still require vastly more data than humans for comparable generalisation.
Robustness & Distribution Shift
Models trained on one distribution fail when deployed conditions differ. A self-driving model trained in California may fail catastrophically in snow. Adversarial examples โ imperceptible perturbations โ fool classifiers with >99% confidence. Consistent performance across distribution shift remains unsolved and is critical for safety-critical deployment.
- Foundation Models are the new paradigm โ one model pre-trained at scale, adapted to many tasks via prompting or fine-tuning
- Neural scaling laws (Kaplan et al., 2020): performance improves predictably with more compute, parameters, and data โ no ceiling yet found
- AI already surpasses humans in perception and language tasks; approaches parity in reasoning and code; still behind in physical commonsense and long-horizon planning
- The AI ecosystem is bifurcating: closed frontier (GPT-4o, Claude, Gemini) vs open weights (LLaMA 3, Mistral, DeepSeek) โ both converging in capability
- Benchmarks are routinely gamed โ saturation โ capability solved. ARC-AGI remains the hardest cheat-resistant general reasoning eval
- Hard open problems โ hallucination, common-sense reasoning, long-horizon planning, causal inference, sample efficiency โ are conceptual gaps, not just engineering ones
- Ch 1.1 โ AI = optimisation toward a goal. Four capabilities: Perceive, Reason, Learn, Act. All today's AI is ANI โ narrow, task-specific, impressive in domain, brittle outside it.
- Ch 1.2 โ Two AI winters from data/compute/algorithm gaps. AlexNet (2012) + Transformer (2017) are the two true inflection points. ChatGPT (2022) = 100M users in 60 days.
- Ch 1.3 โ GOFAI = symbol manipulation. Expert systems worked commercially (XCON, MYCIN) but hit the knowledge bottleneck. GOFAI failure proved intelligence must be learned, not encoded.
- Ch 1.4 โ Chinese Room: syntax โ semantics. Symbol grounding problem motivates multimodal and embodied AI. Hard Problem of consciousness remains unresolved for biological and artificial systems.
- Ch 1.5 โ PEAS framework describes any agent. Five agent types: Reflex โ Model-based โ Goal-based โ Utility โ Learning. This maps directly to modern LLM agents in Domain 8.
- Ch 1.6 โ Connectionism won empirically. The frontier is neuro-symbolic: neural perception + symbolic reasoning โ chain-of-thought, tool use, AlphaGo, AlphaGeometry.
- Ch 1.7 โ Foundation models = the new paradigm. One model, many tasks. Open problems: hallucination, common-sense, long-horizon planning, causal inference remain unsolved.
Domain 1 gave you the vocabulary and mental models. Every subsequent domain takes one part of this picture and goes deep. Domain 2 gives you the mathematics. Domain 3 the algorithms. Domain 4 the architectures. The pieces will connect โ keep going.