If It Isn't Verified, It's Broken

Why verification is most of the work

Here is the number that surprises people new to chips: on a serious design, verification is usually 60–70% of the total effort — more engineers, more compute, more calendar time than the design itself. Why so lopsided? Because hardware is unforgiving. A web app ships a bug, you push a fix that afternoon. A chip ships a bug, and the fix is a new mask set, a new tape-out, and a quarter of your life gone.

So the job is not 'make it work once.' The job is to convince yourself, with evidence, that it works for every input the silicon will ever see. You cannot try them all — even a 32-bit adder has two 32-bit operands, so 2^64 input combinations — so verification is really the art of being convincingly thorough without being exhaustive: pick the right stimulus, watch the right things, and measure how much of the design you actually exercised.

The testbench

A testbench is hardware's test harness — but it's not part of the chip. Think of it as a test bench in an electronics lab: a rig that clips onto your circuit, feeds it signals, and watches what comes back. In Verilog it's a module with no ports — it sits outside the design, instantiates the thing under test (the DUT), drives its inputs, and checks its outputs.

The crucial upgrade over 'just look at the waveforms' is making the testbench self-checking: it knows the right answer ahead of time and shouts if reality disagrees. That `$error` is the whole game — it turns 'I eyeballed it and it looked okay' into 'the machine compared every result against a golden expectation and found none wrong.'

module tb;
  logic [3:0] a, b;
  logic [4:0] sum;
  adder dut (.a(a), .b(b), .sum(sum));   // DUT: the design under test

  initial begin
    a = 4'd7; b = 4'd9;                   // drive inputs
    #1;                                   // let combinational logic settle
    if (sum !== a + b)                    // compare against expected
      $error("bad sum: %0d + %0d gave %0d", a, b, sum);
    $finish;
  end
endmodule

A minimal self-checking testbench: instantiate the DUT, drive inputs, and $error if the output doesn't match the expected sum.

Directed vs constrained-random

The testbench above is a directed test: you, the engineer, hand-picked 7 + 9 because you had a hunch it mattered. Directed tests are precise and readable, and they're perfect for the cases you can name — reset behavior, the overflow boundary, that one corner you know bites. Their weakness is also their nature: you only find the bugs you already thought to look for.

Constrained-random flips it around. Instead of naming each case, you describe the legal space of inputs and let the tool fire thousands of randomized-but-valid stimuli at the design. The constraints keep the randomness sane — a valid opcode, an address in range — while the randomness reaches corners you'd never hand-pick. Run it overnight and it explores combinations no human would type.

class packet;
  rand bit [7:0] addr;
  rand bit [3:0] len;
  constraint legal { addr < 8'd200; len inside {[1:8]}; }  // stay in the legal space
endclass
// call randomize() in a loop -> thousands of valid, varied stimuli

A constrained-random stimulus: addr and len are randomized, but constraints keep every generated packet legal.

Coverage: how much did we test?

Constrained-random raises an uncomfortable question: if the tool is throwing random inputs, how do you know it actually tried the cases that matter? Maybe 100,000 random packets never once hit a length of exactly 8. Coverage is the answer — it's the scoreboard that tells you how much of the design you've actually exercised, turning 'we ran a lot of tests' into 'we hit 94% of the scenarios we said we cared about.'

There are two flavors. Code coverage is automatic and asks 'did every line, branch, and toggle get exercised?' — necessary but shallow. [[functional-coverage|Functional coverage]] is the one that earns its keep: *you* write cover points naming the situations that matter — every opcode seen, the FIFO hit both empty and full, every legal length value — and the tool tallies which ones actually occurred. It answers the real question: are we done?

covergroup cg @(posedge clk);
  coverpoint len    { bins all[] = {[1:8]}; }   // did we see every length 1..8?
  coverpoint state;                              // did we visit every FSM state?
endgroup

A functional coverage group: cover points record which lengths and which states were actually reached during the run.

Assertions: catching bugs in the act

A self-checking testbench catches a bug at the output — eventually, after the bad value has propagated out. An assertion catches it at the scene of the crime: it's a rule embedded right in the design that says 'this must always be true,' and it fires the instant it's violated, pointing straight at the offending signal and cycle.

They shine on rules that span time. A plain `if` checks one instant; an SVA (SystemVerilog Assertion) checks a relationship across cycles — 'a request must be followed by an acknowledge within three clocks,' 'this one-hot signal never has two bits set.' That `|->` reads as implies: when the left side happens, the right side must follow.

// after every request, an ack must arrive within 1 to 3 cycles
property req_gets_ack;
  @(posedge clk) req |-> ##[1:3] ack;
endproperty
assert property (req_gets_ack);

An SVA assertion: whenever req is high, ack must follow within 1 to 3 clock cycles, or the assertion fails and points at the exact cycle.

A glimpse of formal & UVM

Simulation, even with billions of random cycles, only ever tests the inputs you happened to generate — it's sampling an ocean. Formal verification takes a different route: it hands your assertions to a mathematical engine that proves them true for *all* possible inputs, or hands you a concrete counterexample. No stimulus, no coverage gaps — a proof. It doesn't scale to a whole chip, but on a tricky block (an arbiter, a FIFO, a protocol corner) it's the difference between 'we didn't see it fail' and 'it cannot fail.'

As designs grow, ad-hoc testbenches buckle. UVM (Universal Verification Methodology) is the industry's standard framework for building big ones — reusable components (drivers, monitors, scoreboards), a sequencer to generate constrained-random traffic, and a coverage model, all wired together in a way every verification team recognizes. Think of it as the scaffolding that lets a testbench grow from one engineer's script into something a whole team maintains for years.