Short answer: before you model anything, build a mass balance that attributes the gap between theoretical and actual extract across each loss point — then fix the biggest leak first. Brewhouse yield is an accounting problem before it is an AI problem.

PRODUCTION FLOWBrewhouse Yield: Using Analytics to Find Where Extract DisappearsGrainMashBoil & hopsFermentPackage
Where this sits in the beer production flow, start to finish.

Yield loss has named addresses

The gap between theoretical extract and what lands in the fermenter is not a mystery; it has specific addresses. Extract leaks at milling and crush, at lauter grain retention, into trub and dead-legs, during transfer and whirlpool, and through evaporation in the boil. Each is a real, attributable loss.

Mass-balance accounting is the discipline of assigning the missing extract to those addresses. You start from theoretical extract for the grist, subtract each measured loss, and see whether the remainder matches actual extract in the fermenter. When it reconciles, you have a true picture of where your yield goes. When it does not, the discrepancy itself is information — usually a measurement you are not taking.

Accounting before modelling

This is the data-science point worth repeating: measure first, and in this case the measurement is most of the work. A clean mass balance often answers the question outright, no model required. Weigh the grist, sample wort gravity and volume at each transfer, quantify grain-out moisture and retained extract, measure trub volume, and track evaporation. Lay those against theoretical extract and the leaks reveal themselves.

Only once the balance closes does analytics add value — spotting that lauter retention drifts with crush, or that evaporation creeps up on long-boil recipes. That is also why a yield programme depends on having collected the right data in the first place; our note on collecting your data before AI makes the case plainly. For the mash side of the same accounting, see predicting mash efficiency and extract yield.

From a balance to a ranked action list

The generative-AI angle is modest but genuinely useful. A mass balance is a table of numbers most brewers will not act on. An LLM can read that table and turn it into a ranked, plain-language action list: “Your biggest leak this quarter is lauter grain retention at 1.9% of extract — recheck crush and sparge volume first. Evaporation is second at 1.1%; trub losses are small and not worth chasing yet.” It translates accounting into priorities, with the reasoning attached, so the brewhouse fixes the largest leak rather than the most visible one.

Where it breaks

The limit is measurement honesty. A mass balance is only as good as the readings feeding it, and breweries routinely under-measure trub volume, dead-leg holdup, and grain-out extract because they are awkward to sample. If those numbers are estimates, the balance closes on false precision and the ranked action list points you at the wrong leak. Evaporation and transfer losses can also hide inside each other if your volume measurements are coarse. The fix is not a fancier model — it is better instrumentation and consistent sampling until the balance genuinely reconciles. Resist the urge to layer analytics on top of numbers that do not add up.

WHAT DRIVES ITBrewhouse Yield: Using Analytics to Find Where Extract Disappearsinput 1input 2input 3The processqualitycost / risk
What drives the process, and what it changes downstream.

The bottom line

Brewhouse yield is won with arithmetic, not algorithms. Build a mass balance that attributes lost extract to milling, lauter, trub, transfer, and evaporation; close it with honest measurement; then let a generative tool rank the leaks into an action list. Fix the biggest one, remeasure, and repeat. The AI is the messenger here — the accounting does the work.

Part of the Brewing Science & AI track. Related: collecting your data before AI.

Frequently asked questions

Is this an AI project or an accounting project? Accounting first. A mass balance that attributes extract loss across milling, lauter retention, trub and dead-legs, transfer, and evaporation tells you most of what you need. Modelling and AI come after the numbers reconcile.

What’s usually the biggest loss? It varies by brewery, which is the whole point of measuring rather than guessing. Common culprits are lauter grain retention, trub and dead-leg losses, and over-specified evaporation — but you fix the biggest leak you actually find, not the one you assume.

How accurate does my data need to be? Accurate enough to close the mass balance. If theoretical minus measured losses doesn’t reconcile with actual extract, you have a measurement gap to fix before any analytics or model is trustworthy.