What AI Can and Can’t Do (Yet)

Two truths that are both real

You have already met [[artificial-intelligence|AI]] as a field, walked through its history, and seen the few big ways machines learn from data. Now comes the most important — and most abused — question of all: what can today's systems actually do, and where do they fall apart? Get this right and you become hard to fool, by hype and by fear alike.

Two truths hold at the same time, and beginners usually grab only one. The first: modern AI is genuinely, jaw-droppingly capable. It writes fluent prose, translates between languages it was barely trained on, spots tumors radiologists miss, and plays Go better than any human alive. None of that is a trick. The second: the very same systems make confident, absurd mistakes a careful child never would — and often cannot tell you they are unsure.

Pattern-matching is not understanding

Here is the single idea that explains most of AI's strange behavior. A trained model is, at bottom, an extraordinarily rich pattern-matcher. It has seen oceans of examples and learned the statistical shape of them — which pixels tend to mean "cat," which words tend to follow which. When your question lands inside that learned shape, the answer feels almost magical. When it lands just outside, the magic evaporates.

This is why a [[large-language-model|large language model]] can write you a sonnet yet insist that 9.11 is larger than 9.9. It is not reasoning about quantity the way you do; it is predicting plausible next tokens. The text *looks* like understanding because fluent language is what it was shaped to produce. Real comprehension — a stable internal model of how the world works — is something far more is claimed than can be shown.

Philosophers have argued about exactly this for decades — the Chinese Room argument asks whether manipulating symbols by rule, however convincingly, ever amounts to *meaning* them. You do not need to settle that debate. You only need the practical takeaway: a system that matches patterns brilliantly can be right for the wrong reasons, and that is what makes its errors so hard to predict.

The three classic failure modes

If pattern-matching is the cause, three failures are the symptoms you will meet again and again. Learn their names and you will start spotting them everywhere — in the news, in demos, in your own experiments.

Brittleness. A model that scores 99% on its test set can collapse on inputs that look trivially different to us. Change a few pixels invisibly and a confident "panda" becomes a confident "gibbon" — that crafted trick is an adversarial example. More mundanely, a model trained on clean daytime photos may flounder at dusk. The technical word for this fragility under shifted conditions is poor robustness, and it is the gap between a slick demo and a deployed system that survives the real world.

Bias. A model learns the world it is shown, warts and all. If the training photos of "doctor" are mostly men, the model quietly learns that correlation and reproduces it. This algorithmic bias is not the machine being malicious — it is the machine being faithful to flawed data, which is worse, because it scales the flaw and hides it behind a veneer of objectivity.

Hallucination. A generative model will, with total confidence, invent a citation, a court case, or an API that does not exist. This is hallucination, and it follows directly from how the model works: it is built to produce fluent, plausible text, and "plausible" is not the same as "true." The model has no built-in fact-checker; it does not know what it does not know. That is exactly why the fixes are grounding it in real sources and keeping a person in the loop.

Narrow brilliance, not general intelligence

Almost everything deployed today is [[narrow-ai|narrow AI]]: a system superb at one task and helpless one step outside it. The chess engine that crushes grandmasters cannot make you a coffee, or even understand the request. Today's chatbots feel broader because language touches everything — but breadth of topic is not the same as breadth of *competence*. Push on the edges and the narrowness shows.

The dream of one system that flexibly handles any task a human can — artificial general intelligence — does not exist today, and no one can honestly tell you when, or whether, it will. Be especially wary when a confident demo gets quietly extrapolated into a sweeping claim about general intelligence "arriving soon." Impressive narrow results are real; the leap to general capability is a hypothesis, not a fact.

A useful reflex: whenever you see a stunning result, ask "how narrow is the groove?" A model that hits 95% accuracy in the lab may generalize poorly the moment the real world stops resembling its training data — a shift the field calls distribution shift. The number on the slide is not a promise about your situation.

Setting expectations that survive contact

So how do you form expectations that won't shatter on first contact with reality? Not by memorizing what AI can and can't do — that list shifts every few months — but by asking the right questions of any claim. Here is a checklist you can carry into every demo and headline.

How narrow is the task? A model proven on one slice of the world rarely transfers cleanly to another. Ask what it was actually trained and tested on.
What happens when it's wrong? A wrong movie recommendation is harmless; a wrong medical or legal answer is not. Match your trust to the cost of failure.
Can the system tell you it's unsure? Most cannot, and a confident tone is not evidence of correctness. Treat fluency as style, not proof.
Is a human still in the loop? For anything that matters, the reliable pattern is the machine drafts and a person decides — not the machine deciding alone.

The pattern in step four has a name worth keeping: human-in-the-loop. The most successful real-world AI today rarely replaces a person outright. It handles the high-volume, routine fraction — a form of automation — while a human owns the judgment calls, the edge cases, and the accountability. Designing for that division of labor is usually wiser than betting the whole task on the model.

answer = model.generate(question)
if answer.is_high_stakes or answer.confidence_is_unverifiable:
    answer = human_reviews(answer)   # don't ship raw model output
act_on(answer)

The reliable shape of a real AI system: the model proposes, a human checks the consequential parts, and only then do you act.

Holding both truths at once

Hype and doom are the two easy mistakes, and they are mirror images: both treat AI as more unified and more autonomous than it is. The honest stance is harder and far more useful — hold both truths at once. These systems are genuinely powerful and genuinely limited, often in the very same breath, and the same model that drafts your email flawlessly may invent a fact on the next line.

That completes the Foundations rung. You now know what AI is, where it came from, how machines learn, the main ways they do it, and — most importantly — how to size up any AI claim with clear eyes. From here the ladder opens the hood: the small amount of math that powers learning, then data, then the algorithms themselves. You carry the most valuable habit into all of it — the instinct to ask not just "can it?" but "where does it break?"