AI Foundation · Domain 01

Foundations of Artificial Intelligence

What AI actually is, where it came from, why symbolic AI failed, the philosophical debates that matter today, the rational agent framework, competing paradigms, and the state of the field.

1.1

Chapter 1.1 · Definitions & Scope

What Is Artificial Intelligence?

AI is not a single thing. It is a collection of techniques for building systems that optimise toward a goal using data. The magic disappears when you see the math.

Defining Intelligence In-depth

🧬

Biological Intelligence

Embodied — tied to a body with survival pressures
Developed through evolution over millions of years
Energy-efficient: 20 watts powers the human brain
Generalises from very few examples (few-shot by default)
Handles open-ended, ambiguous, novel situations naturally

🤖

Machine Intelligence

Disembodied — no physical survival pressure
Built through training on human-generated data
Energy-hungry: GPT-4 training cost ~$100M in compute
Requires millions–billions of examples to learn patterns
Brittle outside training distribution; exceptional at defined tasks

What makes a system "intelligent"? There is no consensus. The four most-cited criteria:

Ability to learn from experience
Ability to solve novel problems
Ability to understand and generate language
Ability to reason under uncertainty

No current AI system fully satisfies all four in the general sense humans do.

AI Definitions Core

There is no single universally agreed definition of AI. Here are the four most cited, each emphasising a different aspect:

John McCarthy, 1956

"The science and engineering of making intelligent machines, especially intelligent computer programs."

Russell & Norvig — AIMA 4th Ed.

"The study of agents that receive percepts from the environment and perform actions." The textbook definition.

Practical Engineering (2024)

"Building systems that can perceive, reason, learn, and act — performing tasks that traditionally required human intelligence."

ISO/IEC 22989:2022

"An engineered system that generates outputs such as content, forecasts, recommendations, or decisions for a given set of human-defined objectives."

The common thread across all four: AI systems are designed to optimise for a goal. Not magic — sophisticated optimisation machines trained on data. Whether the goal is to win a chess game, classify an image, or predict the next word in a sentence, the mechanism is the same: minimise a loss, maximise a reward.

"AI is whatever humans haven't figured out how to do yet. Once we have an algorithm for it, we stop calling it AI."

Tesler, L. — coined at Xerox PARC, ~1970s | Larry Tesler's Theorem (the AI Effect)

AI ⊃ Machine Learning ⊃ Deep Learning — nested subfields

Term	Formal Definition	Emerged	Canonical Example
Artificial Intelligence	Any technique enabling machines to mimic intelligent behaviour	1956 — Dartmouth Conference	Chess programs, expert systems
Machine Learning	Algorithms that learn from data without explicit programming	1959 — Arthur Samuel	Decision trees, XGBoost, SVMs
Deep Learning	ML using multi-layer neural networks learning hierarchical representations	~2006 / mainstream 2012	GPT-4, ResNet, AlphaFold, DALL-E
Data Science	Extracting insight from data using statistics, ML, and visualisation	2000s	Business analytics, dashboards

The Four Capabilities In-depth

All AI systems — from the simplest spam filter to GPT-4 — can be understood through four fundamental capabilities. These aren't arbitrary categories: they map directly to the architectural components you'll encounter in every system throughout this curriculum.

👁️

Perceive

Receive input from the environment. Sensors, cameras, microphones, text APIs, database queries. The AI's window onto the world begins here.

Example: GPT-4 Vision reading an image; a self-driving car's LiDAR scanning the road ahead.

🧠

Reason

Analyse, compare, infer, and plan. Apply logic, pattern matching, or learned heuristics to understand input and determine what to do next.

Example: A classifier deciding "this email is spam"; an LLM using chain-of-thought to plan multi-step solutions.

📚

Learn

Update the internal model based on feedback, new data, or experience. This is what separates machine learning from traditional rule-based software.

Example: A model's weights adjusting during gradient descent; a recommender system adapting to user feedback.

⚡

Act

Generate outputs: text, decisions, control signals, code, images. Take actions in the world — and potentially change the environment for the next perception cycle.

Example: GPT generating a response; a robot arm moving to pick up an object.

The Four AI Capabilities — Perceive → Reason → Learn → Act

Perceive is the entry point. In large language models, the perception layer is the tokeniser and embedding lookup — raw text transformed into high-dimensional vectors the network can process. In computer vision systems, it's the image pipeline: resize, normalise, encode. In robotics, it's sensor fusion from cameras, accelerometers, and LiDAR.

Reason sits at the heart of what makes AI feel intelligent. In a transformer model, attention mechanisms allow every token to reason about every other token in the context window. In a decision tree, reasoning is the sequence of feature comparisons that route an input to a leaf node prediction.

Learn is the mechanism that makes modern AI powerful. Traditional software is programmed with explicit rules. Learning systems instead infer rules from examples. A loss function quantifies error, and an optimiser updates parameters to reduce it — repeated millions of times across millions of examples.

Act closes the loop. For a language model, acting means sampling the next token from a probability distribution. For a recommendation engine, surfacing the top-k items. For a robotic system, issuing motor commands. The action taken modifies the environment, producing new perception data.

A self-driving car PERCEIVES the road, REASONS about obstacles, has LEARNED from millions of miles of training data, and ACTS by steering. ChatGPT PERCEIVES your text, REASONS using learned patterns, and ACTS by generating tokens one at a time.

AI vs ML vs DL vs Data Science Core

These terms are often used interchangeably in the press, but they have precise relationships — each is a subset of the one above. Consider ChatGPT: it is simultaneously correct to call it AI, ML, and Deep Learning — each statement is true and increasingly specific.

The AI Field — Nested Subfields & Overlaps

However, the reverse implication does not hold. Not all AI is ML — expert systems encode human knowledge directly as rules, without learning from data. Not all AI is deep learning — a random forest or naive Bayes classifier are ML but not deep learning. Data Science is a separate discipline focused on extracting business insight via statistics, visualisation, and ML — it overlaps AI but is not a true subset.

Machine Learning IS:

Machine Learning IS NOT:

Systems that learn rules from data without explicit programming
Improving in performance with more and better data
Generalising learned patterns to new, unseen examples
The dominant paradigm powering modern AI products

Just statistics (it adds architectural inductive biases statistics lacks)
Always neural networks (trees, SVMs, and ensembles are ML too)
Magic or sentient — it is function approximation
Guaranteed to work on any problem without careful engineering

Common misuse: "We use AI" usually means "we use ML" and often specifically means "we use a trained model." Precision matters — especially when evaluating vendor claims. Ask: Is it rule-based? Trained? On what data? Measured by what metric?

Types of AI: ANI, AGI, ASI Core

AI systems can be classified along two axes: scope (what range of tasks?) and capability level (how does it compare to human intelligence?).

🎯

ANI — Narrow AI

Excels at one specific task. Cannot generalise outside its training domain. All AI today is ANI — GPT-4 is brilliant at text but cannot physically navigate a room. AlphaGo beat world champions at Go but cannot play chess.

All AI Today

🧠

AGI — General AI

Can perform any intellectual task a human can. Transfers knowledge across completely different domains without retraining. Not yet achieved. Timeline: 5–20+ years (highly debated). OpenAI's stated mission.

Research Frontier

🚀

ASI — Superintelligence

Surpasses human intelligence in every domain. Could recursively self-improve. Purely theoretical. Motivates AI safety and alignment research today — the "singularity" discussed by Bostrom, Yudkowsky, Tegmark.

Theoretical

The AI Spectrum — ANI → AGI → ASI

✅ What ANI Can Do Well Today

⚠️ What ANI Still Cannot Do Reliably

Fluent, coherent text generation at scale
Image classification (often superhuman accuracy)
Code writing, debugging, and refactoring
High-quality language translation
Strategic game playing (Go, Chess, StarCraft II)
Protein structure prediction (AlphaFold)

Reliable reasoning about novel physical situations
Maintaining consistent long-term plans over time
Transferring skills across domains without retraining
Acting reliably in open-ended, unpredictable environments
Consistently avoiding hallucination and factual error

What AI Is Not Introductory

Public discourse around AI is saturated with misconceptions — drawn from science fiction, corporate marketing, and media sensationalism. Before going deeper into mechanisms, it is worth explicitly naming what AI is not.

🎬

Not Like the Movies

HAL 9000, Skynet, and Samantha from Her are not realistic portrayals. Real AI systems are narrow, brittle outside their training domain, and have no goals of their own — they optimise whatever objective they're given.

✨

Not Magic — It's Statistics

A language model predicts the most likely next token given all previous tokens. An image classifier outputs a probability distribution over labels. Every "intelligent" output is the result of mathematical optimisation on data.

🧬

Not Sentient or Conscious

Current AI systems have no subjective experience, emotions, or desires. They process inputs and produce outputs. Whether future AI could be conscious is a genuine philosophical question — today's systems are not.

🔮

Not Infallible

AI systems fail in unexpected ways. They hallucinate facts, exhibit bias inherited from training data, and can be fooled by adversarial examples. They are tools with specific, well-characterised failure modes — not oracles.

The goal of this documentation is to replace awe with understanding. Every impressive AI output is traceable to training data, an objective function, and an optimisation algorithm. The mystery evaporates when you see the mechanism.

1.2

Chapter 1.2 · Narrative History

A Brief History of AI — Origin to Present

AI has followed a pattern of hype → disillusionment → breakthrough — twice. Understanding why it failed helps you understand why deep learning succeeded where everything before it failed.

AI history is a story of alternating euphoria and collapse. Understanding why each era gave way to the next — not just when — is the best inoculation against misreading hype today. The pattern: overpromise → underfund → winter → unexpected breakthrough → repeat.

AI Progress & Hype — 1950 to 2026

Pre-History (1800s–1943) Introductory

Long before the first computer was built, mathematicians and philosophers dreamed of mechanising thought. Gottfried Wilhelm Leibniz (1646–1716) envisioned a Characteristica Universalis — a universal symbolic language that could encode all human knowledge — and a calculus ratiocinator that could reason over it mechanically. This dream of reducing reasoning to symbol manipulation would resurface, unchanged in spirit, at the Dartmouth Conference three centuries later.

George Boole (1854) formalised logic as algebra — reducing true/false statements to 0s and 1s and defining the operations AND, OR, NOT. This was the mathematical bedrock everything else was built on. Ada Lovelace (1840s), writing annotations on Babbage's Analytical Engine, described the first published algorithm intended for a computing machine — and crucially, articulated both its power and its limits: the machine can only do what it is told. Claude Shannon (1938) showed that Boolean algebra could be implemented in electronic circuits, connecting Boole's logic to physical hardware. And finally McCulloch and Pitts (1943) proposed the first mathematical model of a neuron — a binary threshold unit that fires when its weighted inputs exceed a threshold. Every modern neural network traces its ancestry directly to that 1943 paper.

These five contributions — symbolic reasoning, Boolean logic, algorithms, logic circuits, and artificial neurons — were the intellectual raw materials that the Dartmouth group assembled into a new field in 1956.

Pre-History Milestones — The Intellectual Foundations of AI

Birth of AI (1950–1956) In-depth

In 1950 Alan Turing published "Computing Machinery and Intelligence" — opening with the question "Can machines think?" He proposed the Imitation Game as a pragmatic operational test: if a machine can convince a human judge, communicating only via text, that it is human, we have sufficient practical grounds to attribute intelligence to it. Turing was careful not to define intelligence — he proposed the test precisely to sidestep that philosophical quagmire. He also described the "child machine" concept: rather than programming adult intelligence directly, build a machine that learns. This was the first clear articulation of what we now call machine learning.

Six years later, in the summer of 1956, John McCarthy, Marvin Minsky, Claude Shannon, and Herbert Simon convened at Dartmouth College and officially founded the field of Artificial Intelligence. Their proposal was breathtakingly optimistic: "Every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it." Minsky later claimed that "within a generation the problem of creating AI will substantially be solved." That generation passed without resolution. The Dartmouth group were brilliant researchers who massively underestimated three things: the importance of data, the role of perception and embodiment, and the sheer computational depth required.

The Turing Test (Imitation Game) — Setup & Outcomes

What Dartmouth Got Right

What Dartmouth Got Wrong

AI as a formal, dedicated research field
Symbolic manipulation is a valid form of reasoning
Computers can exhibit intelligent behaviour
The field needs dedicated researchers and funding

Intelligence timeline vastly underestimated
Ignored the critical importance of data
Ignored embodiment, perception, and grounding
Assumed logic alone equals intelligence

The Golden Age (1956–1974) Core

The decade following Dartmouth was genuinely remarkable. Early AI programs demonstrated things no one had seen before: a machine proving mathematical theorems, a program that could hold a conversation, a system that learned to play checkers by playing against itself. In 1959 Arthur Samuel's checkers program — which improved by self-play — gave the field a new term: machine learning. The optimism was not irrational — the demos were real. The problem was that toy domains don't scale.

1957

Perceptron — Rosenblatt

First learning machine: a single-layer network that could learn to classify linearly separable inputs. Proved that machines could learn from examples, not just rules.

1959

"Machine Learning" coined — Arthur Samuel

Samuel's checkers program improved by self-play. He defined ML as "giving computers the ability to learn without being explicitly programmed." Still the best definition.

1965

DENDRAL — Stanford

First expert system with real scientific impact: identified organic molecules from mass spectrometry. Proved narrow domain AI could produce genuine scientific value.

1966

ELIZA — Weizenbaum

First chatbot, simulating a Rogerian therapist using pattern matching. Users formed genuine emotional connections — a warning about anthropomorphism that applies directly to LLMs today.

1969

Minsky & Papert — "Perceptrons"

Proved mathematically that single-layer perceptrons cannot learn XOR. Effectively killed neural network research funding for a decade. The field pivoted entirely to symbolic AI.

1970

SHRDLU — Winograd

NLP system that understood commands in a simulated blocks world. Breathtakingly impressive in its domain. Completely brittle outside it — the "toy world" problem made concrete.

The Two AI Winters (1974–1993) In-depth

The AI winters are the most important episodes in AI history for developing intuition about the current moment. They were not caused by bad science. They were caused by a structural gap: researchers promised capabilities that required data and compute that didn't yet exist, and funders cut support when those promises weren't kept on schedule.

AI Hype Cycles — Two winters shaped modern research culture

❄️

Why AI Failed (Technical)

Combinatorial explosion: symbolic search cannot scale to real-world problem sizes
No data: rule-based systems require hand-crafted knowledge for every fact
No compute: networks were too slow to train at any useful scale
Brittle: systems collapse immediately outside their narrow training domain

⚠️

Why AI Failed (Institutional)

Overpromised: researchers claimed 5–10 year timelines to funders, repeatedly
Underfunded follow-through: funding cut the moment hype peaked
Misaligned incentives: demos optimised for impressiveness, not robustness
The AI Effect: as each thing worked, it stopped being called AI

Winter	Trigger	What Collapsed	Root Cause	What Survived
First (1974–80)	Lighthill Report (UK, 1973); DARPA cuts	General-purpose AI, machine translation, symbolic reasoning	Combinatorial explosion; overpromised timelines to funders	Specialised systems; early expert system research; Prolog
Second (1987–93)	Expert system maintenance failures; Lisp machine market collapse	Commercial expert systems; DARPA Strategic Computing Program	Expert systems too brittle and expensive to maintain at real-world scale	Backpropagation (1986 rediscovery); statistical ML; RL foundations

Every AI winter was caused by the same three gaps: not enough DATA, not enough COMPUTE, and not enough ALGORITHMIC insight. By 2012, all three gaps had been filled simultaneously for the first time in history.

The ML Revival (1993–2012) Core

While AI winters suppressed funding, a quieter revolution was accumulating. Statistical machine learning — SVMs, random forests, gradient boosting — replaced brittle symbolic systems with principled, mathematically-grounded methods. Meanwhile, the web was producing data at a scale no previous generation could have imagined, and GPU hardware was becoming cheap enough to train meaningfully large networks.

1997

Deep Blue beats Kasparov

AI defeats the world chess champion for the first time. Minimax search + hand-crafted evaluation function. Proved AI could match humans in a complex domain — but through brute-force search, not generalisation.

1998

LeNet — LeCun

Convolutional neural networks applied to handwritten digit recognition (MNIST). Showed CNNs work in practice — but GPUs weren't available yet, limiting scale.

2001

Random Forests — Breiman

Powerful ensemble method combining many decision trees. Became the dominant practical ML algorithm for structured data — and remains so in many domains today.

2006

Deep Belief Networks — Hinton

Hinton showed that deep networks could be trained using layer-wise pre-training. Revived interest in deep neural networks after 15 years of dormancy.

2009

ImageNet — Fei-Fei Li

1.2 million labelled images across 1000 categories. The dataset that would change everything — the fuel that AlexNet needed to ignite the deep learning revolution three years later.

2011

IBM Watson wins Jeopardy!

NLP + knowledge retrieval at scale. Beat human champions Ken Jennings and Brad Rutter. Demonstrated that AI could handle open-ended natural language questions — a mainstream moment for AI.

2011

GPUs proven for deep learning — Ciresan et al.

First paper showing GPU-trained deep networks dramatically outperform CPU-trained ones. The compute bottleneck was about to break.

2012

The Three Enablers Converge

ImageNet (data) + NVIDIA CUDA (compute) + ReLU/Dropout/BatchNorm (algorithms) converge for the first time. The conditions for a revolution are now met.

🗄️

Data

Web-scale datasets: ImageNet (1.2M images), Common Crawl (trillions of words), user-generated content from social platforms. Training data grew 1000× in a decade — providing the signal deep networks needed to learn.

⚡

Compute

NVIDIA GPUs and CUDA (2007) gave 100–1000× speedup for matrix operations via massive parallelism. What took weeks on CPUs took hours on GPUs. Training deep networks became economically viable for the first time.

🧪

Algorithms

ReLU activation (solved vanishing gradients), Dropout regularisation (prevented overfitting), Batch Normalisation (stabilised training), residual connections (enabled very deep networks). Each independently helpful; together transformative.

The Deep Learning Era (2012–2022) In-depth

AlexNet's ImageNet victory in 2012 was the moment the field changed permanently. Krizhevsky, Sutskever, and Hinton's network achieved 15.3% top-5 error vs. the runner-up's 26.2% — an 11-point gap that shocked the computer vision community. The model was trained on two consumer GPUs in five days. Within 18 months, every major research lab had pivoted entirely to deep neural networks. The pattern: one shocking result, then near-universal adoption.

Year	Event	Why It Matters
★ 2012	AlexNet — Hinton, Krizhevsky, Sutskever	Wins ImageNet by 10.8-point margin. GPU-trained CNN. Every major lab pivots to deep learning within 18 months. The modern AI era begins here.
2013	Word2Vec — Mikolov (Google)	Word embeddings: words as dense vectors. "king − man + woman ≈ queen." Language has geometric structure. Foundation of all modern NLP.
2014	GANs — Goodfellow et al.	Generative Adversarial Networks: generator vs discriminator. First system to produce photorealistic images. The generative AI era begins conceptually.
2015	ResNet — He et al. (Microsoft)	152-layer network wins ImageNet. Residual skip connections solved vanishing gradients at depth. Made arbitrarily deep networks trainable.
★ 2016	AlphaGo — DeepMind	Beats Lee Sedol 4-1 at Go — more positions than atoms in the observable universe. MCTS + deep RL. Arrived 10–20 years ahead of expert predictions.
★ 2017	"Attention Is All You Need" — Vaswani et al.	The Transformer architecture. Self-attention replaces RNNs entirely. Processes all tokens in parallel. Ancestor of GPT, BERT, DALL-E, Whisper, AlphaFold — every major AI system built since 2018.
2018	BERT & GPT-1 — Google & OpenAI	Pre-train on unlabelled text → fine-tune on tasks. NLP benchmark records shattered across the board. The pre-train → fine-tune paradigm established.
★ 2020	GPT-3 — 175B parameters	Few-shot learning at scale: give three examples in the prompt, model solves the task without fine-tuning. Foundation model era begins. OpenAI initially called it "too dangerous to release."
2021	DALL-E, CLIP, AlphaFold 2	Multimodal AI: images and text in the same embedding space. AlphaFold 2 solves protein structure prediction — a 50-year open problem in biology — in a single paper.

The Transformer deserves a dedicated note. "Attention Is All You Need" (Vaswani et al., 2017) replaced recurrent networks with a single elegant mechanism: self-attention, which allows every token to attend to every other token in parallel. This unlocked two things simultaneously — full context access (no sequential bottleneck) and massive parallelism (scale with compute). The result: BERT, GPT, T5, LLaMA, Whisper, DALL-E, AlphaFold, and virtually every transformative AI system since 2018 is built on Transformers. It is arguably the most consequential single paper in AI history.

The LLM Era (2022–Present) Core

ChatGPT launched on November 30, 2022. It reached 1 million users in 5 days and 100 million users in 60 days — the fastest consumer technology adoption in recorded history. Instagram took 2.5 years to reach the same milestone. Twitter took 5 years. This wasn't a product launch — it was a cultural event. For the first time, a general-purpose AI system was accessible to anyone with a browser, and it worked well enough to be genuinely useful for everyday tasks.

When	Event	Why It Matters
★ Nov 2022	ChatGPT — OpenAI	100M users in 60 days — fastest consumer adoption ever. RLHF alignment made LLMs genuinely helpful. AI moved permanently into mainstream public discourse.
★ Mar 2023	GPT-4 — OpenAI	Multimodal (text + images). Near-human on bar exam, medical licensing, and professional benchmarks. Set the capability standard that triggered the multi-model race.
Mar 2023	Claude (Anthropic) & Gemini (Google)	Multi-model competitive landscape emerges. Constitutional AI alignment (Anthropic) and multimodal training at Google scale. No single company dominates — the race is on.
★ Jul 2023	LLaMA 2 — Meta (open weights)	Open-source LLMs publicly released. Anyone with a GPU can run, fine-tune, or modify frontier-class models. Democratised AI development and sparked the open-source ecosystem.
★ 2024–2026	Reasoning Models & Agentic AI	o1, o3, DeepSeek-R1: models that think before answering. Claude 3.7, Gemini 2: long-context, multimodal, agentic. Computer-use agents. Physical AI (humanoid robots + LLMs). Current frontier.

The AI Progress Flywheel Core

Unlike the boom-bust cycles of the past, the current AI wave is reinforced by a self-amplifying economic flywheel. Better models attract more users. More users generate more revenue and data. More revenue funds more compute. More compute trains better models. The cycle has no obvious brake — which is precisely why the pace of improvement has been relentlessly accelerating since 2012, and why projection-based estimates consistently underestimate where we will be in three years.

The AI Progress Flywheel — Why the pace keeps accelerating

The flywheel has been spinning faster every year since 2012. Training compute for frontier models has grown roughly 4× per year. GPT-4's training run cost an estimated $100M. Next-generation frontier models are projected at $1B+. This isn't reckless spending — the returns on better models are large enough that the economics reinforce continued investment. Unlike the AI winters, there is no plausible external shock that would stop this cycle — only the discovery of fundamental capability limits, which has not yet materialised.

📋 Chapter 1.2 — Key Takeaways

Two AI winters caused by the same three gaps: insufficient data, compute, and algorithms — not bad science
AlexNet (2012) and Transformer (2017) are the two true inflection points of modern AI
ChatGPT (Nov 2022) — 100M users in 60 days — made AI mainstream overnight; Instagram took 2.5 years
Expert systems failed because knowledge cannot be fully encoded as rules — ML learns it from data instead
The AI flywheel: better models → more investment → more compute → more data → better models
Understanding AI winters is essential for evaluating today's hype with appropriate skepticism and calibration
2024–2026: reasoning models + agentic AI = the current research and product frontier

1.3

Chapter 1.3 · Knowledge & Reasoning

Symbolic AI & Knowledge Representation

Symbolic AI is commonly omitted from modern documentation. This is a mistake. Understanding why symbolic AI dominated for 30 years — and exactly why it failed to scale — is what explains why neural networks won. More importantly, it reveals what neural networks still cannot do.

GOFAI: Good Old-Fashioned AI Core

The philosopher John Haugeland coined the term GOFAI (Good Old-Fashioned AI) in 1985 to describe the dominant paradigm that had governed AI research since Dartmouth. Its core assumption: "Intelligence is symbol manipulation according to rules." Feed a machine symbols representing the world, give it logical rules for manipulating those symbols, and intelligence will emerge. The physical symbol system hypothesis (Newell & Simon, 1976) formalised this: "A physical symbol system has the necessary and sufficient means for general intelligent action." This claim was bold, wrong in the absolute sense, and enormously productive for 30 years.

Two schools developed within GOFAI. The logic-based school (Newell, Simon) grounded AI in formal logic and theorem proving — intelligence as deduction. The knowledge-based school (Minsky, McCarthy) focused on representing domain knowledge richly enough that a system could reason over it — intelligence as structured lookup and inference. Both shared the same fatal assumption: that the hard part of intelligence is the reasoning engine, not the knowledge itself.

Symbolic AI

Connectionist AI (Neural Networks)

Explicit rules written by human experts
Fully interpretable — you can read the logic
Fails hard on edge cases rules don't cover
No data needed — rules are manual
Dominated 1956–1986

Patterns learned directly from data
Black box — hard to interpret weights
Robust to variation and noise
Needs large labelled datasets
Dominated 1986–present

Symbolic AI vs. Neural AI — How Knowledge Is Represented

Knowledge Representation In-depth

Before an AI system can reason, it must represent knowledge. GOFAI researchers developed four major representational frameworks — each with a distinct philosophy about what knowledge is and how it should be stored.

🕸️

Semantic Networks

Nodes = concepts, edges = typed relationships. "Animal → IS-A → Mammal → IS-A → Dog → HAS → Fur". Inheritance flows through IS-A links — Dog automatically inherits all Mammal properties. Precursor to modern knowledge graphs (Wikidata, Google Knowledge Graph, RAG retrieval indexes).

📋

Frames (Minsky, 1974)

Structured objects with named slots and default values — essentially a class/struct for knowledge. A "Car" frame has slots: color (default: red), wheels (default: 4), engine: present. Frames support inheritance and overriding. Directly influenced object-oriented programming.

🏗️

Ontologies

Formal specification of all concepts and relationships within a domain. Medical: SNOMED CT (350,000+ concepts). Web: OWL, RDF standards. The Cyc project (Lenat, 1984–present): 25M+ rules encoding common-sense knowledge over 40 years. Modern AI: ontologies power knowledge graphs in RAG pipelines.

⚙️

Production Rules

IF [condition] THEN [action] — the basis of all expert systems. Forward chaining: data-driven, fire rules that match current known facts, derive new facts until goal reached. Backward chaining: goal-driven, work backward from desired conclusion to find what must be true. Prolog uses backward chaining natively.

Semantic Network — Concepts connected by typed relationships

 // Frame representation — structured knowledge with slots & defaults
 FRAME: Car

  slots:

    color:   default=red,     type=string

    wheels:  default=4,       type=integer

    engine:  default=present,  type=boolean

    owner:   default=nil,      type=Person-Frame
 
 FRAME: SportsCar (inherits: Car)

  overrides:

    turbo:   default=true

    top_speed: default=250 // km/h, overrides Car default

Expert Systems Core

Expert systems were the first commercially successful AI technology. The idea: interview a domain expert, encode their knowledge as production rules, build an inference engine to fire those rules. In the late 1970s and throughout the 1980s, this actually worked — inside carefully bounded domains with well-defined rules.

 // MYCIN-style production rule — bacterial infection diagnosis
 // Rule 52
 IF organism.staining_reaction = "gram-negative"
 AND organism.morphology       = "rod"
 AND patient.compromised_host  = true
 THEN organism.identity = "Pseudomonas" (confidence: 0.6) 

System	Domain	Year	Rules	Key Achievement
DENDRAL	Chemistry	1965	500+	First automated scientific reasoning — identified organic molecules from mass spectrometry
MYCIN	Medical diagnosis	1972	600	65% accuracy vs 42% for Stanford medical students. Never deployed — liability concerns.
XCON / R1	Computer configuration	1980	2,500	Saved DEC $40M/year configuring VAX systems. Proved AI had real commercial ROI.
PROSPECTOR	Mineral exploration	1978	1,000	Discovered a $100M molybdenum deposit. Geological knowledge + probabilistic inference.
Cyc	Common-sense reasoning	1984	1M+	Still maintained. World's largest manually encoded knowledge base. Represents the limit of the approach.

✅

What Expert Systems Got Right

Formal representation of domain knowledge works within bounds
Explainable by design — every decision traceable to a rule
Can capture rare cases that training data won't cover
Still used in medical, legal, and financial rule engines today

❌

Why Expert Systems Failed

Knowledge bottleneck: experts can't articulate their tacit knowledge as rules
Brittleness: catastrophic failure on any case the rules don't cover
Maintenance nightmare: every edge case needs a manual rule update
Can't learn: no mechanism to improve from new data or outcomes

Logic & Formal Reasoning In-depth

Logic is the mathematics of valid inference. GOFAI researchers hoped to give AI the ability to reason with the same rigor as formal proof — deriving new knowledge from existing knowledge with absolute certainty. Three levels of expressive power matter for AI.

🔣

Propositional Logic

True/false propositions with connectives: AND (∧), OR (∨), NOT (¬), IMPLIES (→). Fully decidable. Too limited: can't express properties of objects or quantify over them.

P = "It rains"   Q = "Ground is wet"

P → Q    // if it rains, ground is wet

P        // it rains (given)

∴ Q     // modus ponens: ground is wet

∀

First-Order Logic (FOL)

Adds variables, predicates, and quantifiers. Used in theorem provers, Prolog, OWL ontologies. Semi-decidable — powerful but computationally expensive.

∀x: Human(x) → Mortal(x)  // all humans are mortal
 Human(Socrates)            // Socrates is human

∴ Mortal(Socrates)         // universal instantiation 

🔀

Non-Monotonic Reasoning

Classical logic: once proved, always true. Real world: "Birds fly... unless they're penguins." Non-monotonic reasoning handles exceptions and defaults — conclusions can be retracted when new information arrives. Essential for commonsense AI.

 /* First-order logic in Prolog — forward/backward chaining */
 human(socrates). human(plato).
 mortal(X) :- human(X). /* ∀X: human(X) → mortal(X) */
 
 /* ?- mortal(socrates).  → true  */
 /* ?- mortal(zeus).      → false (not in KB) */
 
 bird(tweety). flies(X) :- bird(X). /* birds fly by default */
 penguin(opus). bird(opus) :- penguin(opus).
 flies(X) :- penguin(X), !, fail. /* penguins don't — non-monotonic exception */ 

Inference Strategies — Forward Chaining vs Backward Chaining

Search Algorithms In-depth

If knowledge representation is how GOFAI stores the world, search is how it reasons through it. AI as search through a state space was the dominant problem-solving paradigm from 1956–1986. Define a state (current configuration), a set of actions (legal transitions), and a goal (desired configuration). The AI finds a sequence of actions from start to goal. Simple in principle — catastrophically hard in practice as state spaces grow.

🌊

Uninformed Search

BFS: explore layer by layer — finds shortest path, memory expensive O(b^d)
DFS: explore deep first — memory efficient O(d), may not find shortest path
Iterative Deepening DFS: optimal like BFS, memory like DFS — best of both worlds

🎯

Informed (Heuristic) Search

A*: f(n) = g(n) + h(n) — actual cost + heuristic estimate. Optimal if h is admissible (never overestimates actual cost)
Greedy best-first: h(n) only — fast but not optimal
Used in: pathfinding, GPS navigation, game AI, robot motion planning

BFS vs A* — Why heuristics matter

Algorithm	Complete?	Optimal?	Time	Space	Best For
BFS	✓ Yes	✓ Yes (unweighted)	O(b^d)	O(b^d)	Shortest path, small state spaces
DFS	✓ Yes	✗ No	O(b^m)	O(bm)	Memory-constrained, finding any path
A*	✓ Yes	✓ Yes (admissible h)	O(b^d)	O(b^d)	Pathfinding with good heuristic
Minimax	✓ Yes	✓ Yes	O(b^m)	O(bm)	2-player zero-sum games
Alpha-Beta	✓ Yes	✓ Yes	O(b^m/2)	O(bm)	Games — 2× faster with pruning

A* in practice: g(n) = actual cost from start. h(n) = heuristic estimate to goal (e.g., straight-line distance for routing). f(n) = g(n) + h(n). Admissibility condition: h(n) must never overestimate actual cost. If admissible, A* is guaranteed optimal. Minimax & Alpha-Beta: Deep Blue evaluated 200M chess positions per second using Alpha-Beta pruning to defeat Kasparov in 1997 — search without learning. Modern engines (Stockfish) combine Alpha-Beta with learned evaluation functions.

Why Symbolic AI Failed Core

GOFAI wasn't bad science — it produced real results in bounded domains. It failed because three fundamental problems proved impossible to solve within its paradigm. These aren't engineering problems. They are conceptual limits on what rule-based symbol manipulation can express.

🖼️

The Frame Problem

McCarthy & Hayes (1969): when an action occurs, what changes and what stays the same? "I move a cup — does the coffee stay in it? Does the table change? Does gravity still apply?" Humans know intuitively what's relevant. Encoding this completeness is impossibly hard in a finite rule set.

🔒

Knowledge Bottleneck

Expert knowledge is largely tacit — "know-how" not "know-that." Doctors, chess players, and engineers can't fully articulate the pattern recognition driving their decisions. This tacit knowledge is the data that neural networks learn from examples — it can't be extracted through interviews.

💥

Brittleness & Explosion

Symbolic systems fail immediately outside their defined domain — no graceful degradation. Real-world ambiguity can't be captured by finite rules. 1,000 rules produce millions of interaction cases. The combinatorial explosion of exceptions makes complete rule coverage computationally intractable.

The lesson of GOFAI: Intelligence is not stored in explicit rules. It emerges from exposure to experience. This insight is the entire foundation of machine learning — neural networks don't fail on the frame problem because they simply learn what matters from data.

📋 Chapter 1.3 — Key Takeaways

GOFAI assumption: intelligence = symbol manipulation by rules — worked in narrow domains, failed at scale
Knowledge representation: semantic networks, frames, ontologies, production rules — all have modern descendants
Expert systems worked commercially (XCON, MYCIN) but hit the knowledge acquisition bottleneck
FOL & Prolog: powerful for formal reasoning, but undecidable and computationally expensive at scale
A* search: optimal heuristic pathfinding — still used in robotics, navigation, and game AI today
Symbolic AI's failure proved that intelligence must be learned from experience, not encoded as rules

1.4

Chapter 1.4 · Philosophy of Mind

The Turing Test & Philosophy of Mind

Turing’s 1950 paper opens with “I propose to consider the question, Can machines think?” — then immediately sidesteps it. The question matters not because we can answer it, but because our answer determines what we build and how we align it. These are not idle philosophical puzzles.

The Turing Test In-depth

Alan Turing’s 1950 paper “Computing Machinery and Intelligence” is the founding document of AI philosophy. He proposed the Imitation Game as a pragmatic replacement for the unanswerable “Can machines think?” A human interrogator communicates via text with two participants — one human, one machine. If the interrogator cannot reliably distinguish which is which, the machine has passed. Turing predicted that by 2000, a machine would fool 30% of judges after 5 minutes. He was roughly right about the timeline, but the test itself proved easier to pass than it was to mean anything.

The Turing Test — Alan Turing, 1950 · “Computing Machinery and Intelligence”

🎭

Critique 1: Behaviour ≠ Intelligence

A system could pass using clever tricks without understanding anything. ELIZA (1966) fooled users into thinking they were talking to a therapist using simple pattern matching. GPT-4 arguably passes the test today — does that mean it “thinks”?

🦇

Critique 2: Wrong Benchmark

A bat navigates in darkness via echolocation — we don't test AI on that. Why is human-style conversation the benchmark for ALL intelligence? AlphaGo is superhuman at Go but would fail the Turing Test. The test is anthropocentric by design.

🀄

Critique 3: The Chinese Room

John Searle argues a system could pass the test without understanding anything. The most famous critique of the Turing Test — see Section 2 for the full argument.

🏆

Has Anything Passed It?

Eugene Goostman (2014) claimed to pass — controversy ensued (judges were lenient). Modern LLMs like GPT-4 routinely fool humans in short conversations. The test itself may now be obsolete as a meaningful benchmark.

Test / Benchmark	What It Measures	Year	Current Status
Turing Test	Language imitation — can machine fool a human?	1950	Easily gamed by modern LLMs — not a meaningful bar
Winograd Schema	Common-sense reasoning via pronoun resolution	2011	LLMs solved it by 2022 — retired as a benchmark
ARC Challenge	Novel visual pattern reasoning	2019	GPT-4 scores ~85% — approaching human level
MMLU	Knowledge across 57 academic subjects	2020	GPT-4 ~87%, human expert ~89% — essentially matched
BIG-Bench Hard	23 hard multi-step reasoning tasks	2022	Frontier reasoning models approaching human performance

The Chinese Room In-depth

In 1980 philosopher John Searle published “Minds, Brains, and Programs” — arguably the most influential and most debated paper in AI philosophy. His thought experiment: you are locked in a room. Through a slot, people pass slips of paper with Chinese characters. You have a rulebook: “When you see symbol sequence X followed by Y, write Z and pass it back.” You follow the rules perfectly. From outside, your responses are indistinguishable from those of a fluent Chinese speaker. But you understand nothing. You are just manipulating symbols according to formal rules.

Searle’s conclusion: syntax is not sufficient for semantics. A program that processes symbols according to rules — no matter how sophisticated — does not thereby understand those symbols. It has the form of language processing without the content. By analogy: AI programs process language according to learned statistical rules. They produce outputs indistinguishable from understanding. But this doesn’t mean they understand.

Searle's Chinese Room — Syntax Without Semantics

🏛️

Systems Reply

It’s not the person who understands — it’s the whole system (person + rules + room). Similarly, a single neuron doesn’t understand, but the brain does. You must evaluate the system, not its components in isolation.

🤖

Robot Reply

Put the room in a robot with sensors and actuators. Now symbols are connected to real-world referents — “fire” is associated with heat sensors. Grounding in the physical world might be what produces genuine understanding.

🧠

Brain Simulator Reply

What if the rulebook simulated every neuron in a Chinese speaker’s brain exactly? Would that produce understanding? If not, then biological neurons have no special status either — and consciousness is unexplained by either substrate.

🔄

Searle’s Counter-Reply

All these replies move the lack of understanding around without eliminating it. The systems that result still only process symbols formally — there is no genuine intentionality. Intentionality requires biology, Searle argues.

The Chinese Room is not a solved argument. GPT-4 is, in a very real sense, an enormously complex Chinese Room. Whether “understanding” requires something beyond symbol manipulation remains genuinely open — and the answer determines how seriously we should take AI wellbeing and alignment.

Consciousness & Machine Sentience Core

Philosopher David Chalmers (1995) distinguished easy problems of consciousness — explaining cognitive functions like attention, memory, and behaviour — from the Hard Problem: why does all this processing feel like something from the inside? Why isn’t cognition just computation happening “in the dark”, without any inner experience? This question may be permanently beyond empirical investigation, because any physical description of a brain state leaves open why there is subjective experience associated with it.

🌌

The Hard Problem

Why does it “feel like something” to be conscious? Even a perfect physical description of brain activity doesn’t explain qualia — the redness of red, the painfulness of pain. Chalmers argues this may permanently resist scientific explanation.

⚙️

Functionalism

Mental states are defined by their functional role, not their physical substrate. If silicon performs the same functional operations as a brain, it thereby has mental states. This view is most sympathetic to the possibility of AI consciousness.

Integrated Information Theory

Consciousness = integrated information Φ (phi). High Φ = rich inner experience. A simple logic gate: Φ ≈ 0. A human brain: high Φ. Simple feedforward neural networks have very low Φ; richly recurrent systems could have more.

Whether current AI systems are conscious is not a scientific question we can currently answer. It is prudent to neither confidently assert nor confidently deny machine sentience. The honest answer is: we don’t know — and we don’t yet have the tools to find out.

The Symbol Grounding Problem Core

Stevan Harnad (1990) identified a fundamental problem with purely symbolic AI: symbols in a dictionary are defined in terms of other symbols. “Cat: a small domesticated carnivorous mammal with soft fur…” — circular definitions all the way down. For humans, symbols are grounded in sensorimotor experience. You know “red” because you have seen red. You know “hot” because you have felt heat. For large language models, symbols are grounded only in other symbols — the statistical contexts in which words appear across trillions of tokens of text.

Grounded Symbols (Human)

Ungrounded Symbols (LLM)

“hot” = direct thermal sensation experienced via skin receptors
“red” = specific wavelength of light experienced visually
“rough” = tactile perception from physical touch
Meaning anchored in sensorimotor interaction with the world

“hot” = statistical co-occurrence with “fire”, “burn”, “temperature”
“red” = co-occurrence with “apple”, “stop sign”, “blood”
“rough” = co-occurrence with “sandpaper”, “texture”, “jagged”
Meaning is pattern of word co-occurrence in training corpus only

This is why multimodal models (CLIP, GPT-4V, Gemini) and embodied robotics are active research frontiers — they attempt to ground language in perceptual or physical experience. Whether statistical grounding in text is “sufficient” for understanding is precisely the empirical question the field is now testing at scale.

Strong vs Weak AI Introductory

🔧

Weak AI (Narrow AI)

Searle’s term: AI that simulates intelligence for specific tasks without genuine understanding. The system behaves as if it understands, but has no intentionality. All current AI is Weak AI — GPT-4, AlphaGo, image classifiers. The useful and productive engineering view: build systems that work, regardless of philosophical status.

🌟

Strong AI

A system that genuinely understands — not just simulates understanding. Would have beliefs, desires, and genuine intentionality in the philosophical sense. Searle argued this is impossible without biological substrate. Most AI researchers bypass this distinction entirely and focus on capabilities.

The practical view: for engineering purposes, the distinction doesn’t matter. Build systems that work. The question of whether they “truly understand” is a philosophical question that doesn’t affect whether your spam filter catches spam — but it does affect how we think about alignment, rights, and long-term AI governance.

Frames of Intelligence Reference

🧩

Multiple Intelligences (Gardner)

Eight distinct intelligences: linguistic, logical-mathematical, spatial, musical, bodily-kinaesthetic, interpersonal, intrapersonal, naturalistic. AI today dominates the first two; is largely absent from the last five.

🤸

Embodied Cognition

Intelligence is shaped by having a body that interacts with the world. Physical AI (robotics + LLMs) is the active frontier precisely because disembodied language models lack grounding in physical causality.

🏛️

Cognitive Architectures

ACT-R (Anderson) and SOAR (Laird, Newell) are computational models of human cognition with procedural memory, declarative memory, and attention modules — making testable predictions verified against human reaction time data.

📋 Chapter 1.4 — Key Takeaways

The Turing Test measures behavioural imitation, not intelligence — modern LLMs routinely pass it, rendering it obsolete as a benchmark
Chinese Room: syntax ≠ semantics — symbol manipulation without understanding may not be intelligence; the argument remains unresolved
The Hard Problem: why is there subjective experience at all? Science cannot currently answer this for biological or artificial systems
Symbol grounding: LLMs know words from statistical co-occurrence patterns, not sensorimotor experience — multimodal AI attempts to address this
Strong AI (genuine understanding) vs Weak AI (behavioural simulation) — all current AI is Weak AI by Searle’s definition
These philosophical questions directly motivate alignment research: if AI can have goals and understanding, its objectives must be aligned with human values

1.5

Chapter 1.5 · AIMA Framework

Problem Solving & Rational Agents

PEAS, environment types, and the agent taxonomy — the conceptual framework that connects classical AI to modern LLM agents. Russell & Norvig's Artificial Intelligence: A Modern Approach (AIMA) defines AI as the study of agents that perceive their environment and act to maximise their performance measure. This framework is the conceptual backbone of Domain 8 (Agentic AI) — understanding it now is essential.

The Rational Agent Framework In-depth

From Russell & Norvig's Artificial Intelligence: A Modern Approach — the dominant textbook in AI education — an agent is "anything that perceives its environment through sensors and acts upon that environment through actuators." The word "agent" is deliberately broad: it covers thermostats, chess programs, autonomous vehicles, and GPT-4 equally.

What makes an agent rational? For each possible percept sequence, a rational agent should select an action expected to maximise its performance measure, given the evidence provided by the percept sequence and whatever built-in knowledge the agent has. Rationality ≠ omniscience. Rationality ≠ perfection. Rationality = expected utility maximisation given available information.

Why does this abstraction matter? Because it unifies everything — a thermostat perceives temperature and acts by switching heating, GPT-4 perceives your text and acts by generating tokens, a self-driving car perceives the road and acts by steering. Every AI system in this documentation can be analysed through this lens.

The Agent-Environment Loop — the foundational AI abstraction

Definition: A rational agent selects actions expected to maximise its performance measure, given the evidence in its percept sequence and built-in knowledge.

Rationality ≠ omniscience (knowing all outcomes). Rationality ≠ perfection (always choosing optimally). Rationality = expected utility maximisation given available information.

PEAS Framework In-depth

PEAS stands for Performance measure, Environment, Actuators, Sensors. It is the standard tool for formally specifying any AI agent task. Before building any AI system, PEAS forces you to answer four questions: What counts as success? What will the agent interact with? How can it act? What can it perceive?

PEAS Framework — Specifying any AI agent task

Agent	Performance Measure	Environment	Actuators	Sensors
Self-driving car	Safe arrival, comfort, legal compliance, efficiency	Roads, traffic, pedestrians, weather, other vehicles	Steering wheel, brakes, accelerator, horn	Cameras, LiDAR, GPS, radar, odometer
Medical diagnosis AI	Correct diagnosis, patient safety, cost efficiency	Patient data, records, lab results, imaging	Treatment recommendation, flag urgent cases	EMR systems, lab APIs, imaging APIs, notes
ChatGPT / LLM Agent	Helpful, harmless, honest; task completion	Human conversation, web, code environment	Text output, API calls, code execution, file writes	Text input, tool outputs, user feedback
Chess-playing AI	Win games, minimise blunders	Chess board, opponent's moves	Move selection (display update)	Board state (current position)

Environment Types Core

Not all AI environments are created equal. The type of environment an agent operates in determines which algorithms are suitable, how much memory the agent needs, and how complex its decision-making must be. AIMA identifies six key dimensions along which environments vary.

Property	Definition	Example: Fully X	Example: Partially X	Impact on Agent Design
Observable vs Partial	Can agent sense complete state?	Chess (full board visible)	Poker (opponent cards hidden)	Partial → needs memory and belief states
Deterministic vs Stochastic	Are outcomes predictable?	Chess (moves are certain)	Self-driving (pedestrians random)	Stochastic → needs probability reasoning
Episodic vs Sequential	Do past actions affect future?	Image classification (each independent)	Chess / Conversation (history matters)	Sequential → needs memory
Static vs Dynamic	Does environment change while agent thinks?	Crossword puzzle	Real-time trading	Dynamic → needs fast decisions
Discrete vs Continuous	Finite vs infinite states/actions?	Chess (countable positions)	Driving (continuous steering)	Continuous → needs function approximation
Single vs Multi-agent	One or many agents?	Sudoku solver	Multiplayer games, stock market	Multi-agent → must model other agents

⚠️

Most Challenging: Partial + Stochastic

Real-world AI tasks are almost always both. A self-driving car can't see around corners (partial) and pedestrians behave unpredictably (stochastic). This combination is why AI for physical environments is so hard.

🧠

Most Impactful: Sequential

If past matters for future — which it does in almost all useful tasks — the agent needs memory. This is why LLMs have context windows and why agents need long-term memory stores.

🤝

Emerging: Multi-agent

Modern AI increasingly involves multiple agents. LLM orchestration (LangGraph, AutoGen, CrewAI) is multi-agent by design. Critical for Domain 8: Agentic AI.

Why environment type matters for design: A fully observable, deterministic, episodic, discrete environment (like chess) can in principle be solved with a lookup table. A partially observable, stochastic, sequential, continuous, multi-agent environment (like autonomous driving) requires probabilistic reasoning, memory, planning, and robustness to uncertainty — all simultaneously. The gap in engineering complexity is enormous.

Types of Agents In-depth

Russell & Norvig define five progressively sophisticated agent architectures. Each adds a new capability layer. Understanding this hierarchy maps cleanly to the spectrum from a smoke detector to a fully autonomous AI assistant.

Agent Sophistication Spectrum — From Reflex to Learning

🔦

① Simple Reflex Agent

Condition-action rules mapping current percepts directly to actions. No memory. No history. Fast and auditable but completely brittle — fails whenever current percept doesn't capture all relevant state.

Thermostat, smoke detector, basic spam filter
Limitation: useless in partially observable environments

🗺️

② Model-Based Reflex Agent

Maintains an internal state — a model of the unobservable parts of the world. Updates state based on percepts over time. Can handle partial observability that pure reflex agents cannot.

SLAM robot, Roomba tracking its path
Limitation: rules still don't reason about goals

🎯

③ Goal-Based Agent

Has an explicit goal and searches for action sequences to achieve it. Uses search algorithms (A*, BFS) to plan multiple steps ahead. More flexible — multiple paths to the same goal.

GPS navigation, STRIPS planner, theorem provers
Limitation: all goals binary — doesn't handle tradeoffs

📊

④ Utility-Based Agent

Uses a utility function — graded preferences between states, not just goal/no-goal. Maximises expected utility (probability × value). Handles stochastic outcomes naturally.

Recommendation systems, portfolio managers, RL agents
Foundation of decision theory and modern RLHF for LLMs

🎓

⑤ Learning Agent

Any agent type augmented with a learning component that modifies behaviour based on experience. Four internal components: performance element (selects actions), critic (evaluates against a standard), learning element (improves performance), problem generator (suggests exploratory actions). All modern AI systems are learning agents — GPT learned from text, AlphaGo Zero learned from self-play.

State-Space Problem Formulation Core

Before an agent can search for a solution, it must formally define the problem. AIMA's five-component problem formulation is the standard method. It forces precision: what exactly is a "state"? What exactly is an "action"? What exactly is the "goal"?

Define Initial StateStarting configuration

Define ActionsAvailable transitions

Transition ModelState after action

Goal TestIs this a goal state?

Path CostCost to reach state

Search AlgorithmFind solution

State Space Search — find the path from start to goal

Element	Definition	Example: 8-Puzzle	Example: GPS Navigation
Initial state	Starting configuration	Random tile arrangement	Current location
Actions	Set of possible moves from each state	Slide tile left, right, up, down	Turn left/right, go straight
Transition model	Result of taking each action in each state	New tile arrangement after sliding	New position after movement
Goal test	Determines if current state is goal	Tiles in order 1–8, blank at bottom-right	Current location = destination
Path cost	Numeric cost of a path	Number of moves taken	Distance or estimated travel time

Why search-based AI was replaced: The 8-puzzle has ~181,000 reachable states. The 15-puzzle has ~10¹². Chess: ~10⁴⁶. Go: ~10¹⁷⁰. Real-world AI problems are combinatorially intractable for exhaustive search — which is precisely why machine learning (pattern recognition from data) displaced search as the dominant paradigm for perception, language, and unstructured reasoning tasks.

📋 Chapter 1.5 — Key Takeaways

Agent = anything that perceives and acts. PEAS (Performance, Environment, Actuators, Sensors) formally specifies any agent task
Environments are: observable/partial, deterministic/stochastic, episodic/sequential, static/dynamic, discrete/continuous, single/multi-agent
Partially observable + stochastic = most real-world tasks. This is why AI for physical environments is so hard.
Five agent types: Reflex → Model-based → Goal-based → Utility-based → Learning — each adds a capability layer
State-space search: define initial state, actions, transitions, goal test, path cost — then search. Combinatorial explosion limits this approach at scale.
This framework directly maps to modern LLM agents (Domain 8): percepts = tool outputs + messages, actions = tool calls + text generation

← Ch 1.4 Turing Test & Philosophy Ch 1.6 Key Paradigms →

1.6

Chapter 1.6 · Schools of Thought

Key Paradigms & Schools of Thought

AI is not one field — it is a collection of competing intellectual traditions with different assumptions about what intelligence is and how to build it. Understanding the camps explains why researchers from different traditions argue past each other, and why hybrid approaches are gaining traction.

Connectionism In-depth

Connectionism is the paradigm that won. Inspired by the structure of the biological brain, it proposes that intelligence emerges from networks of simple connected units — and that knowledge is stored not in explicit rules, but in the strengths of connections between units. A single neuron knows nothing; billions of them, connected with learned weights, produce language, vision, and reasoning.

The lineage runs from McCulloch & Pitts (1943) — the first mathematical neuron — through Rosenblatt's Perceptron (1957), through Rumelhart & Hinton's backpropagation (1986) which made multi-layer training practical, through LeCun's LeNet (1998) and AlexNet (2012), all the way to the Transformer (2017) and the LLM era. Every step was the same insight applied at larger scale with more data and compute.

⚡

Key Properties

Parallel distributed processing — thousands of units compute simultaneously
Graceful degradation — partial damage reduces performance gradually, not catastrophically
Learning from examples — weights adapt through exposure, not programming
Statistical regularities — finds patterns in data regardless of whether humans can articulate them

📅

The Connectionist Timeline

1943 — McCulloch-Pitts neuron: first mathematical model
1957 — Rosenblatt Perceptron: first learning machine
1986 — Rumelhart & Hinton: backpropagation for multi-layer nets
1998 — LeCun's LeNet: CNNs on handwritten digits
2012 — AlexNet: GPU-trained deep CNN wins ImageNet
2017 — Transformer: attention replaces recurrence entirely
2022+ — LLMs: trillion-parameter connectionist systems

Connectionist Network — Intelligence in weights, not rules

The Great Debate In-depth

For three decades the battle between symbolicism and connectionism was not a polite academic disagreement — it was personal, bitter, and high-stakes. Minsky's famous dismissal of perceptrons (1969) killed neural network funding for a decade. Connectionists returned fire in 1986 with backpropagation. The symbolicism camp called neural networks "black box statistics". The connectionist camp called expert systems "fragile toy programs". Funding, careers, and the direction of the entire field were at stake.

Symbolicism (GOFAI)

Connectionism (Neural Networks)

Intelligence = symbol manipulation according to rules
Knowledge = explicit, human-readable rules and facts
Interpretable by design — you can read the logic
Works with zero data — rules are programmed in
Generalises by logical deduction from axioms
Can solve novel problems if rules cover them
Brittle — fails hard outside the defined domain

Intelligence = statistical pattern matching in weights
Knowledge = distributed across millions of parameters
Black box — learned weights are not human-readable
Requires large labelled datasets to train
Generalises by interpolation within training distribution
Struggles with truly novel, out-of-distribution problems
Robust — degrades gracefully under noise and variation

🏛️

Symbolicism's Best Argument

"Neural networks can't do compositional reasoning. They can't understand that 'the dog bit the man' is fundamentally different from 'the man bit the dog' — they just see statistical co-occurrences. Language has recursive structure that requires rules, not statistics. Neural networks that can't count reliably can't be the basis of real intelligence."

🧠

Connectionism's Best Argument

"Where are the rules for understanding a face? For recognising speech across different accents? For generating coherent prose? The rules are impossibly complex to write — the only viable path is to learn from data. The brain itself is a neural network, not a rule engine. Evolution didn't write logic programs — it tuned connection weights."

The debate was largely resolved by the empirical success of deep learning in the 2010s. But symbolicism's core insight — that structured, compositional reasoning matters — has returned in the form of neuro-symbolic AI, chain-of-thought prompting, and structured reasoning models like o1 and DeepSeek-R1.

Dimension	Symbolicism	Connectionism	Current Status (2026)
Core claim	Intelligence = symbol manipulation by rules	Intelligence emerges from connected simple units	Connectionism dominant; hybrid gaining
Key figures	McCarthy, Minsky, Newell, Simon	Rosenblatt, Rumelhart, Hinton, LeCun, Bengio	Hinton, LeCun, Bengio — 2018 Turing Award
Strengths	Interpretable, logically consistent, structured	Learns from data, robust, generalises well	Both needed; neither sufficient alone
Weaknesses	Brittle, doesn't scale, knowledge bottleneck	Black box, data-hungry, fails OOD	Interpretability & robustness unsolved
Modern form	Knowledge graphs, formal verification, SMT solvers	LLMs, diffusion models, transformers	Neuro-symbolic = active frontier

Probabilistic & Bayesian AI Core

The third major paradigm takes a different starting point: the world is fundamentally uncertain, and any intelligent system must represent and reason about that uncertainty explicitly. Where symbolicism asks "what is true?", probabilistic AI asks "how confident am I, and how should new evidence update that confidence?" This is Bayes' theorem as the engine of intelligence: P(hypothesis | evidence) ∝ P(evidence | hypothesis) × P(hypothesis).

🎲

Bayesian AI

Models uncertainty explicitly as probability distributions over hypotheses. Bayes' theorem: P(H|E) ∝ P(E|H) × P(H). New evidence updates your prior belief to a posterior. Principled — but computing exact posteriors is often intractable, requiring approximations: MCMC, variational inference, Laplace approximation.

Applications: spam filtering, medical diagnosis, sensor fusion, A/B testing, causal inference

🔗

Probabilistic Graphical Models

Bayesian networks: directed acyclic graphs encoding conditional dependencies — efficient inference via variable elimination. Hidden Markov Models (HMMs): hidden states with observable outputs — dominated speech recognition 1980s–2010s. Gaussian Processes: non-parametric Bayesian learning with uncertainty estimates.

Still used: robotics (SLAM), bioinformatics, probabilistic programming (Stan, PyMC, Pyro)

Bayesian Network — explicit probabilistic dependencies

Probabilistic AI never dominated the way symbolicism or connectionism did, but it remains indispensable in specific contexts. Robotics relies heavily on Bayesian state estimation (Kalman filters, particle filters). A/B testing and causal inference are Bayesian at their core. Modern LLM calibration — how well a model's confidence scores match actual accuracy — is a probabilistic question. RLHF uses Bayesian reasoning about reward models. The probabilistic paradigm's contribution is the rigorous treatment of uncertainty that pure neural approaches often lack.

Evolutionary Computation Reference

🧬

How It Works

Maintain a population of candidate solutions
Evaluate each against a fitness function
Select the fittest for reproduction
Apply crossover (recombination) and mutation
Repeat — no gradients required
Powerful where gradient descent fails: discontinuous objectives, black-box functions, combinatorial spaces

🚀

Modern AI Applications

Genetic Algorithms — solutions as bit-string chromosomes, selection pressure
Evolution Strategies (ES) — optimise continuous parameters; OpenAI showed ES competes with RL backprop
NEAT — neuroevolution: evolves both topology and weights of neural networks
Neural Architecture Search (NAS) — discovers superior architectures (MobileNet, EfficientNet)
Hyperparameter optimisation — black-box search over learning rates, depths, widths

Genetic Algorithms NEAT Neural Architecture Search Evolution Strategies Neuroevolution

Embodied & Situated AI Reference

🤖

Rodney Brooks & the Core Thesis

MIT, late 1980s: "Intelligence without Representation" (1991)
Explicit world models are unnecessary — and may be counterproductive
Subsumption architecture: layered behaviours, each suppressing lower ones
Behaviours: avoid obstacles → wander → follow walls → seek light
Result: surprisingly robust behaviour with no search, no world model
Contrast: GOFAI built maps of the world; Brooks robots just acted in it

🌍

Why It Matters for Modern AI

Intelligence emerges from physical interaction with the world, not just computation
RT-2 (Google/DeepMind) — LLM reasoning + embodied robot actions
π₀ (Physical Intelligence) — foundation model for physical manipulation
Symbol grounding (Ch 1.4) may require embodiment to solve
An AI that has felt heat and grasped objects has symbols grounded in experience
Pure text-trained LLMs lack this physical grounding entirely

Neuro-Symbolic AI Core

The frontier of AI research in 2026 is increasingly defined by the attempt to combine the strengths of both paradigms: neural perception and fluency with symbolic precision and compositionality. Neither pure approach is sufficient — neural networks hallucinate and fail at systematic reasoning; symbolic systems can't perceive raw inputs or handle uncertainty. The hybrid is not a compromise — it is a new paradigm that can do things neither can do alone.

⚠️

Why Pure Neural Struggles

Hallucination — generates confident falsehoods with no symbolic grounding
Arithmetic errors — counting and algebra are unreliable in raw LLMs
Logical inconsistency — contradicts itself across long contexts
Rule-following failures — can't reliably follow strict constraints
OOD brittleness — fails on distributions far from training data

🔬

Why Neuro-Symbolic Is the Frontier

Tool-using LLMs — symbolic precision (calculator, code) + neural fluency
Chain-of-thought — structured reasoning steps in natural language
AlphaGo / AlphaZero — neural value/policy network + MCTS symbolic search
AlphaGeometry (2024) — neural proof-step generator + symbolic verifier, solves IMO problems
Knowledge Graph + LLM — structured facts + generative reasoning (RAG)

The Neuro-Symbolic Spectrum — most frontier AI sits in the middle

The five most important neuro-symbolic integrations in production AI today:

①Tool use — LLMs call Python interpreters, calculators, and search APIs for symbolic precision. Neural fluency + symbolic correctness.
②Chain-of-thought prompting — forcing explicit intermediate reasoning steps mimics symbolic deduction chains. Each step is verifiable.
③AlphaGo / AlphaZero — the neural value network supplies learned intuition; MCTS provides principled symbolic tree search. Neither alone reaches superhuman level.
④RAG (Retrieval-Augmented Generation) — a knowledge graph or vector store (symbolic) is queried by a neural LLM. Grounds generation in verifiable facts.
⑤Reasoning models (o1, DeepSeek-R1) — extended internal "thinking" with verifiable intermediate steps before producing output. The most direct implementation of neuro-symbolic reasoning in frontier LLMs.

📋 Chapter 1.6 — Key Takeaways

Connectionism: knowledge stored in weights, learned from data — robust, scalable, but opaque. The dominant paradigm since 2012.
Symbolicism: knowledge in explicit rules — interpretable, compositional, but brittle and unable to scale to real-world complexity
Probabilistic AI: uncertainty as a first-class citizen — Bayesian networks, HMMs, still essential in robotics and calibration
Evolutionary computation: selection + mutation — used in NAS, hyperparameter optimisation, and black-box RL
Embodied AI: intelligence emerges from physical interaction — motivates robotics frontier and addresses symbol grounding
Modern frontier is neuro-symbolic: neural perception + symbolic reasoning — chain-of-thought, tool use, AlphaGo, AlphaGeometry

← Ch 1.5 Rational Agents Ch 1.7 AI Landscape Today →

1.7

Chapter 1.7 · Current Landscape

The AI Landscape Today & Tomorrow

This chapter connects Domain 1's history and theory to the present, and gives you a map of where each subsequent domain fits. AI in 2026 is defined by foundation models as the new default paradigm — but with hard limitations that motivate everything that follows in this curriculum.

The Current AI Landscape Core

The most important structural shift in AI since 2020 is the emergence of Foundation Models — large models pre-trained on broad, internet-scale data that can be adapted to almost any downstream task via prompting, fine-tuning, or RLHF. The old paradigm: train a separate model for each task (one for translation, one for summarisation, one for classification). The new paradigm: one model does all of them, often better than the specialised predecessors.

This shift was driven by neural scaling laws (Kaplan et al., 2020): performance on downstream tasks improves predictably as a power-law function of compute, parameters, and data — with no sign of diminishing returns until very recently. The implication: more scale reliably buys more capability, making the economics of frontier model training self-reinforcing.

The Foundation Model Paradigm — one model, many capabilities

📐

Scale

Billions to trillions of parameters. Trained on internet-scale data across months on thousands of GPUs. GPT-4 estimated $100M+ to train. Next-generation frontier models: estimated $1B+. The compute requirements double every 6–12 months.

✨

Emergence

Capabilities that weren't explicitly trained emerge at scale — arithmetic, code generation, multi-step reasoning, in-context learning. Below a capability threshold: zero competence. Above it: sudden, surprising ability. Emergent capabilities are scale's most striking phenomenon.

🔧

Adaptability

One model adapts to hundreds of tasks via prompting alone. No task-specific training required for most applications. Fine-tuning (LoRA, QLoRA) and RLHF allow further specialisation when needed. The pre-train → adapt paradigm replaced train-from-scratch.

AI Capability Map In-depth

AI capability is not uniform across tasks. Understanding where AI already surpasses humans, where it approaches parity, and where it still falls short is essential for calibrating realistic expectations — and for knowing which research problems remain open.

AI Capability Map — where AI excels, approaches, and falls short

Capability	Benchmark	Human Score	Best AI (2026)	Status
Image Classification	ImageNet Top-5	~95%	99%+	✅ Surpassed
Speech Recognition	LibriSpeech WER	~5% WER	2–4% WER	✅ Surpassed
Reading Comprehension	SQuAD 2.0	89.5 F1	93+ F1	✅ Surpassed
Professional Knowledge	MMLU	89%	87–90%	≈ At parity
Code Generation	HumanEval	~85% (experts)	90%+	≈ At parity
Mathematical Reasoning	MATH (competition)	~60% (AMC)	70–85%	≈ Approaching
Common-Sense (physical)	PIQA	~95%	85–90%	⚠ Still behind
Long-horizon Planning	Novel open tasks	High	Low–medium	⚠ Still behind

Key Players & Ecosystem Core

🔴

OpenAI

GPT-4o, o1, o3 series. Dominant commercial position. ChatGPT ~200M+ active users. $13B Microsoft partnership and deep Azure integration. Reasoning models (o1/o3) introduced chain-of-thought as a first-class capability. Controversial governance restructuring in 2024.

🔵

Anthropic

Claude 3.5 Sonnet / Opus. Safety-focused, Constitutional AI alignment. $4B Amazon investment + AWS deployment. Long context (200K tokens). Claude 3.7 introduced extended thinking. Preferred by enterprises with compliance requirements.

🟢

Google DeepMind

Gemini 1.5 Pro / Ultra. Deep integration with Google Search and Workspace. TPU hardware advantage. AlphaFold 2 solved protein folding. AlphaCode, Gemini robotics (RT-2). Gemini 2.0 Flash: fast multimodal at scale.

🟡

Meta AI

LLaMA 3 (open weights: 8B, 70B, 405B). Democratising AI development. The LLaMA ecosystem powers most open-source fine-tuning, quantisation, and deployment. Fuelled the open-source explosion that now competes with closed models.

🟣

Mistral AI

European AI champion. Mistral Large, Codestral, Mixtral (mixture-of-experts). Efficient models with open weights. Positioned as privacy-first alternative. GDPR-aligned for European enterprise deployment.

🌐

Open Source Community

Hugging Face ecosystem (models, datasets, Spaces). GGUF/llama.cpp local inference. LoRA/QLoRA fine-tuning. DeepSeek-R1 (China, open weights). Qwen (Alibaba). Phi-3 (Microsoft). Democratised deployment without API dependency.

Dimension	Closed Source	Open Source
Examples	GPT-4o, Claude 3.5, Gemini 1.5	LLaMA 3, Mistral, DeepSeek-R1, Qwen
Performance	Highest capability at frontier	Increasingly competitive (within 10–20%)
Cost	Per-token pricing ($0.001–$0.10/1K tokens)	Infrastructure cost only — near zero at scale
Privacy	Data sent to vendor servers	On-premise possible — full data control
Customisation	Limited (fine-tune API, system prompts)	Full weight access — fine-tune anything
Safety	Vendor-managed RLHF alignment	Community-managed — variable quality
Transparency	Minimal (model cards, no architecture details)	Weights + often training details available

Benchmarks & Measuring Progress In-depth

Benchmarks are the field's primary mechanism for measuring progress — and its primary mechanism for self-deception. The benchmark lifecycle is predictable: a new benchmark is created to test a genuine capability gap → models improve rapidly → benchmark saturates → researchers create a harder benchmark. The cycle repeats every 12–24 months.

Goodhart's Law in AI: "When a measure becomes a target, it ceases to be a good measure." AI benchmarks are routinely gamed — through training data contamination, fine-tuning on test distributions, or optimising prompts for specific test formats. Benchmark saturation does not mean a capability is solved. ARC-AGI was designed by François Chollet specifically to resist shortcut-learning — it remains the hardest general reasoning evaluation as of 2026.

Benchmark	Domain	What It Tests	Human Baseline	Status
ImageNet	Vision	1000-class image classification	~95%	Saturated — AI >99%
GLUE / SuperGLUE	NLP	Language understanding tasks	87% / 89%	Saturated — replaced
MMLU	Knowledge	57 academic subjects (multiple choice)	89%	At parity — gameable
HumanEval	Coding	Python function generation from docstrings	85% (experts)	Near parity (90%+)
MATH	Mathematics	Competition math (AMC/AIME level)	60% (AMC students)	AI ~70–85%
ARC-AGI	Reasoning	Novel visual pattern abstraction	85%	AI ~40% — hardest eval
BIG-Bench Hard	Diverse	23 reasoning-heavy tasks	~65%	AI 75%+ approaching
GPQA (Diamond)	Science	PhD-level biology, chemistry, physics	65–80%	AI 60–70% approaching

⚠️

How Benchmarks Get Gamed

Data contamination: test set data appears in model training corpus
Distribution fine-tuning: model trained on benchmark training split, tested on test split
Prompt engineering: optimising prompts for specific test formats inflates scores
Memorisation: model recalls answers rather than reasoning to them

🎯

What Makes a Good Benchmark

Novel problems: not solvable by pattern-matching training data
Human-verifiable: ground truth is unambiguous
Diverse: many task types so no single trick wins
Dynamic: can be updated as old tasks saturate (e.g. ARC-AGI 2)

Open Problems in AI In-depth

Despite the extraordinary progress of the last decade, fundamental problems remain unsolved. These are not engineering challenges awaiting more compute — they are conceptual gaps that may require new paradigms to address. Understanding them prevents over-hyping current capabilities and points toward where research is most needed.

💬

Hallucination & Factual Reliability

LLMs generate confident falsehoods. Retrieval-augmented approaches reduce but don't solve the problem. Root cause: models are trained to produce fluent, plausible text — not verified facts. A model that reliably says "I don't know" when it doesn't know would be enormously valuable. Calibrated uncertainty is an open research problem.

🧩

Common Sense & Physical Reasoning

A 2-year-old knows a dropped ball falls. AI still fails simple physical scenarios. LLMs absorbed much commonsense from text but miss the causal world model behind it. The Winogrande benchmark reveals systematic errors on pronoun resolution humans find trivial. Physical reasoning requires causal world models — not statistical text patterns.

🗓️

Long-Horizon Planning

Current agents handle ~10–50 step tasks reliably. Real-world projects span hundreds of interdependent steps over days or weeks. Errors compound: one wrong action early invalidates all downstream planning. Agents also lack persistent memory across sessions and suffer context window limits. Autonomous multi-day task completion remains out of reach.

🔀

Causal Inference

Current AI is fundamentally correlational. "Countries with more hospitals have more disease" — a correlational model concludes hospitals cause disease. Judea Pearl's causal hierarchy (association → intervention → counterfactual) identifies exactly what's missing. Moving from correlation to causation is essential for reliable decision-making in medicine, policy, and science.

📉

Sample Efficiency

A child learns "dog" from ~10 examples. GPT-4 needed hundreds of billions of tokens to achieve comparable breadth. Humans generalise from far fewer examples via strong inductive biases built by evolution. Few-shot meta-learning addresses this but current models still require vastly more data than humans for comparable generalisation.

🎯

Robustness & Distribution Shift

Models trained on one distribution fail when deployed conditions differ. A self-driving model trained in California may fail catastrophically in snow. Adversarial examples — imperceptible perturbations — fool classifiers with >99% confidence. Consistent performance across distribution shift remains unsolved and is critical for safety-critical deployment.

Open Problems in AI — The frontier challenges of 2024–2026

📋 Chapter 1.7 — Key Takeaways

Foundation Models are the new paradigm — one model pre-trained at scale, adapted to many tasks via prompting or fine-tuning
Neural scaling laws (Kaplan et al., 2020): performance improves predictably with more compute, parameters, and data — no ceiling yet found
AI already surpasses humans in perception and language tasks; approaches parity in reasoning and code; still behind in physical commonsense and long-horizon planning
The AI ecosystem is bifurcating: closed frontier (GPT-4o, Claude, Gemini) vs open weights (LLaMA 3, Mistral, DeepSeek) — both converging in capability
Benchmarks are routinely gamed — saturation ≠ capability solved. ARC-AGI remains the hardest cheat-resistant general reasoning eval
Hard open problems — hallucination, common-sense reasoning, long-horizon planning, causal inference, sample efficiency — are conceptual gaps, not just engineering ones

🎓 Domain 1 Complete — Foundations of AI

Ch 1.1 — AI = optimisation toward a goal. Four capabilities: Perceive, Reason, Learn, Act. All today's AI is ANI — narrow, task-specific, impressive in domain, brittle outside it.
Ch 1.2 — Two AI winters from data/compute/algorithm gaps. AlexNet (2012) + Transformer (2017) are the two true inflection points. ChatGPT (2022) = 100M users in 60 days.
Ch 1.3 — GOFAI = symbol manipulation. Expert systems worked commercially (XCON, MYCIN) but hit the knowledge bottleneck. GOFAI failure proved intelligence must be learned, not encoded.
Ch 1.4 — Chinese Room: syntax ≠ semantics. Symbol grounding problem motivates multimodal and embodied AI. Hard Problem of consciousness remains unresolved for biological and artificial systems.
Ch 1.5 — PEAS framework describes any agent. Five agent types: Reflex → Model-based → Goal-based → Utility → Learning. This maps directly to modern LLM agents in Domain 8.
Ch 1.6 — Connectionism won empirically. The frontier is neuro-symbolic: neural perception + symbolic reasoning — chain-of-thought, tool use, AlphaGo, AlphaGeometry.
Ch 1.7 — Foundation models = the new paradigm. One model, many tasks. Open problems: hallucination, common-sense, long-horizon planning, causal inference remain unsolved.

Domain 1 gave you the vocabulary and mental models. Every subsequent domain takes one part of this picture and goes deep. Domain 2 gives you the mathematics. Domain 3 the algorithms. Domain 4 the architectures. The pieces will connect — keep going.

← Ch 1.6 Key Paradigms Domain 2: Mathematics for AI →