Which Beer Should We Make Next? How I Let the Data Decide

Short answer: in the years I spent developing beers for AB InBev, SABMiller and United Breweries, the hardest question was never how to brew — it was which beer to brew at all. AI and analytics didn’t answer that for me, but they turned a wall of consumer surveys, retail scan data and sales history into a ranked shortlist I could actually act on. The data narrows the field; people still make the call. Here’s how the front end of new product development really works.

The front end of NPD: many noisy data sources, one screening model, a short list worth brewing.

The room was full of opinions, not evidence

Every new-product meeting I sat in started the same way: a dozen ideas on a whiteboard, each championed by someone senior who was sure theirs was the winner. A stronger wheat beer. A low-bitterness lager for new drinkers. A festive seasonal. Everyone had a hunch; nobody had a way to compare them.

That is the real problem at the front end of new product development. You cannot brew twelve beers to full scale to find out which one sells — each pilot costs weeks and money. You have to choose before you brew, and for years that choice was made on seniority and gut feel.

What I actually crunched

The data existed; it was just scattered and messy. My job became pulling it into one place and making it comparable. Three streams mattered most:

Retail scan and depletion data — what was actually selling, by style, pack and region, and which segments were growing quarter on quarter. This is the closest thing to truth, because it records what people bought, not what they said.
Consumer research — panels, surveys, and taste-test scores. Useful, but I learned to weight it carefully: stated preference and shopping behaviour are different animals.
Our own launch history — every beer we had released, what we forecast, and what it did. This was the most honest teacher, because it showed where our optimism had been wrong before.

The data-science part was unglamorous and decisive: cleaning inconsistent SKU names, aligning regions, and shaping it all into features I could score. Measure first, model second — the model is only ever as good as the table you feed it.

From a wall of numbers to a ranked shortlist

Once the data was clean, the analytics were almost simple. I scored each idea against demand signals: was its segment growing, was there white space competitors hadn’t filled, did similar past launches succeed? A clustering model on consumer data helped me segment drinkers into groups that a single average had been hiding — and one of those groups was clearly under-served.

This is where generative AI now changes the rhythm of that work. Today I’d point a language model at the messy free-text in survey responses and competitor reviews and have it summarise the recurring themes in minutes — work that used to take me days of reading. I treat its summary as a lead to verify, never as a finding, but it turns reading into triage.

The output was never “make this beer.” It was a ranked shortlist of three to five concepts with the evidence attached. That is exactly the right altitude for a model: narrow the field, then hand it to people.

Where it breaks

I have to be honest about the failures, because the front end is where over-confidence is most expensive.

The data predicts the routine well and the rare well almost never. A model trained on existing styles is blind to a genuinely new category — it had no way to see the non-alcoholic wave coming, because there was no history of it to learn from. Surveys flattered ideas that flopped at the shelf. And thin data is dangerous: feed a flexible model a handful of past launches and it will rank a dud highly with total confidence, the same trap I hit in my first demand-forecasting project. The data tells you where demand is plausible. It cannot promise a hit.

The bottom line

The front end of NPD is a filtering problem, and that is what AI and analytics are genuinely good at: taking a noisy, contradictory pile of consumer and market data and turning it into a defensible shortlist. It replaced “the loudest person in the room” with “the evidence on the table” — and that alone made the beers I went on to develop better bets. But the data narrows; it does not decide. The judgement that picks the winner from the shortlist still belongs to people who know the brand and the drinker.

Beer NPD with Data — Part 1 of 3. See the full series · Next: crunching recipe and sensory data →

Frequently asked questions

Can AI decide which new beer to develop? Not decide — narrow. AI and analytics are excellent at ranking a long list of ideas against consumer, market and sales data so you develop the few with real demand behind them. The final call still belongs to people who know the brand and the market.

What data do you need before developing a new beer? Retail scan and depletion data, consumer panel and survey responses, category and competitor trends, and your own historical launch performance. The quality of the front-end decision is capped by the quality of this data, not the cleverness of the model.

Why do most new beers fail despite market research? Because surveys capture what people say, not what they buy, and because a confident model trained on thin or biased data will still rank a dud highly. The data tells you where demand is plausible; it cannot guarantee a launch.