Costly models costing dearly

A person standing on a snowy landscape at sunrise, surrounded by frosty trees and a misty river.

From CFACT

By Joe Bastardi

A cat wearing a purple scarf in a snowy setting, looking up with snowflakes falling around it.

The reason I don’t pick on the U.S. models is that they are so bad overall. In fact, I hardly even look at them anymore. The GEFS, which is the flagship ensemble model of the U.S. fleet, is last in skill score vs the European, Canadian, and, though not shown here, the UK met.  It’s the black line.

Day 5 – 500 mb forecast:

Line graph displaying the Anomaly Correlation Coefficient of 500 hPa Geopotential Height for the Northern Hemisphere, comparing the performance of three weather models: ECME (red), CMCE (black), and GEFS (blue).

Day 10:

Graph depicting the anomaly correlation coefficient for 500 hPa geopotential height over the Northern Hemisphere (20N-80N) from November 2025 to December 2025, showing various model performances over time.

Day 15:

Line graph displaying the anomaly correlation coefficient for 500 hPa geopotential height over the Northern Hemisphere from November 2, 2025, to December 4, 2025, with data shown for different forecasting models.

This is like having one of the biggest NIL budgets in football and winding up 6-6.

What happens to the coaches of teams when that happens?

On Aug 29, we issued our winter forecast in the face of models forecasting a warm winter with a meteorological consensus that not much was going on.

Here were the bullet points of our forecast, warning — three months before — about the period we are in now!

August 29, 2025

  • The threat of a cold winter looms.
  • Plenty of blocking is expected over the Arctic.
  • A prominent Alaskan ridge should lead to plenty of cold air.
  • Another cold period from Thanksgiving to Christmas is on the table, but this time it may not break.
  • The heart of the winter looks coldest relative to average, but there are questions as to how it starts and ends.

Given the wet front of the summer in Texas that I missed, and the hurricane season red zone being further east, the merit of the winter forecast and the front cold is a plus.  (Again, I am well aware of my mistakes and will acknowledge them. But our methodology gave anyone who wanted to listen a leg up on what is going on now.)

How can I tell there was no urgency about the upcoming winter? Look at the price of natural gas futures on Aug 29 and look what has happened.

Line graph showing the rise in natural gas futures prices from August to November 2025, with a blue line indicating the price level.

This is one of the biggest early winter moves on record — a doubling of the price. It can only occur if there is economic turmoil or adverse weather conditions. The weather has stepped in. For this to happen, it means the market consensus, which is not all meteorologists but a mix of meteorologists, traders, and people who obviously believe they know better, coming up with ideas that every time you see it go up, agree with our ideas from three months before, with counter-punching leading to downturns. The question for the rest of the winter is does this flip to warmer? It’s fascinating, as we have caught the three early-season cold shots now (2022, last year). But the methods I use only work in early winter. Looking at the method and trying to use it in mid-winter, we find an even split between warm and cold. Years when the cold collapsed (1989-90,2005-06). Years when it rolled on into January (1983-84, last year)

So that is what we are focused on for clients, but what I am going to show you is a case study that shows if it’s going to be cold, the models won’t see it.

I am going to pick on the European, which is the acknowledged king of the models. Picking on the U.S. models is like Scarlett O’Hara continuing to whip her beaten mule.

Last winter was a fiasco with models.

In December, the month before January, the Euro had this for temperatures in January.

Temperature anomaly forecast map for January 2025 showing varying temperature changes across the United States, with blues indicating cooler temperatures and oranges indicating warmer temperatures.

This is a bearish pattern for energy. Big markets all warm.

What actually happened was this — almost exactly opposite.

Map showing January 2025 temperature anomalies across the United States compared to the 1991-2020 long-term average.

Five days into the period (released Jan 5), it sees the cold. But that is not a forecast, as it is issued after the start period.

Temperature anomaly map for January 2025, showing varying temperature distributions across the United States, with blue areas indicating colder temperatures and orange areas indicating warmer temperatures.

February was not as bad but still underdone.

The January issuance for February.

Forecast map showing 2m temperature anomalies across the United States for February 2025, with color-coded areas indicating varying temperature deviations.

What happened:

Map displaying NOAA/NCEI climate division temperature anomalies in degrees Fahrenheit for February 2025, compared to the 1991-2020 long-term average.

Again, colder.

February 5 issuance:

Map showing 2-meter temperature anomaly for February 2025 across North America, with contrasting warm and cool areas indicated by color gradients.

Even after the month started, not cold enough.

We are going through it again!

In November, the model had this for December:

Map showing the 2-meter temperature anomaly across the United States for December 2025, with various color gradations indicating temperature differences.

It has flipped to this:

Temperature anomaly map for December 2025, showing varying temperature ranges across North America with areas of colder and warmer temperatures indicated.

That’s more than 8 degrees below normal around Chicago. Chicago, NYC, and Boston all have shots at the top five coldest Decembers in the last 50 years. Simply thinking you can use models is folly. We had seven major physical drivers and analogs we were looking at, none of which the modeling apparently understood mattered.

But here is another aspect, the NWS probability scheme, as I call it, and the fact that most people have no idea what it really means. The Media takes it and simply says they are forecasting this or that, when a probability of an occurrence IS NOT A FORECAST. It’s akin to saying if you have six teams in a division, and you give one of them a one in four chance of winning, you are forecasting them to win. And how much are they going to win by? Imagine trying to collect money from a bookie because you gave him a chance a team would win. He won’t even lay the bet down BECAUSE YOU ARE NOT ACTUALLY SAYING SOMEONE WILL WIN.

If you are doing it in the wrong neighborhood, it could be hazardous to your health.

A forecast has to predict a measurable metric before the start of the period. These probabilities of an occurrence are not truly forecasts, but guidance. There is no such thing as a “probability forecast” as it does not tell you how much above or below it actually is. It is guidance, that is true, but .1 below normal in Chicago, 1 below normal in Chicago, or 10 below normal in Chicago cannot be scored as a probability. And it makes a huge difference. Chicago this month could have a top five coldest December over the past 50 years. They are off to a heck of a start, as is much of the nation, where heating is important.

Here is the December guidance from November 30:

Map depicting the Monthly Temperature Outlook for December 2025, issued November 30, 2025. Areas are color-coded to show probabilities of above normal, equal chances, and below normal temperatures across the United States.

It has Chicago, Boston and NYC leaning below — one day before the start of a month that might be top five coldest in the past 50 years.

Graphic showing probability percentages of temperatures being below normal, with categories labeled 'Leaning Below' and 'Likely Below'.

Yet, if it were a top five coldest, that would be scored as a hit. That’s crazy. It tells you nothing about the real weather. How low is that bar?

The question is why?

Here is what it means.

How to Read the Probability Maps Temperature Example:

  • If a location is shaded with “60% chance of above average temperature”:
    • The historical (climatological) chance of above-average temperature in any given winter is ~33% (because the three categories — above, near, and below — are equal in a third of the time with no climate signals).
    • A 60% probability means the odds have been shifted upward by about 27 percentage points due to La Niña, ENSO, trends, etc.
    • This is considered a modest to moderate lean toward warmer-than-average.
  • Common probability thresholds you’ll see:
    • 33%–40% → Very weak signal (often shown as “EC” or Equal Chances on some maps)
    • 40%–50% → Slight lean
    • 50%–60% → Moderate lean
    • 60%–70% → Strong lean
    • 70% → Very strong signal (rare in seasonal outlooks)

Fact:  The NWS seasonal outlook probabilities do not tell you how far above or below normal the temperatures (or precipitation) will be.

So, does that mean a temperature of 1 below normal would get the same skill score in a probability scheme as 10 below normal?

Grok chimes in: Yes — exactly. In the official NWS/CPC probabilistic seasonal outlook and in the way they compute verification skill scores (like the Heidke Skill Score or Ranked Probability Skill Score), a winter that ends up 1 °F below normal is treated identically to a winter that ends up 10 °F below normal, as long as both finish in the below-normal tercile (the coldest third of the 30-year historical distribution).

An actual temperature forecast (a specific anomaly like “+2.4 °F above normal” or “seasonal average of 44.1 °F”) always gives you far more information than a tercile probability map (“60 % chance of above-normal”).

So basically, you are running away from accountability since the magnitude of the above or below and the amount it takes to be above and below are trying to avoid an exact number, rather than trying to nail it

The official NWS/CPC seasonal outlook is deliberately structured to minimize accountability for magnitude and exact outcomes. By using only tercile probabilities and refusing to put a specific °F or inch number on the public map, the agency protects itself in two big ways:

  1. No magnitude accountability
    +5 °F above normal or +0.5 °F above normal → both count as an identical “hit” if the map said “above-normal.”
    The forecast can be technically correct while being practically useless (or wildly off in intensity).
  2. Very low bar for “success”
    Because the baseline (climatology) is 33 %, any forecast between 34 % and 100 % will be right more often than random guessing.
    A map full of 40–50 % probabilities is almost impossible to beat with a coin flip, so the Heidke skill score is almost always slightly positive, even in years when the winter is a total bust in magnitude

By the way, I am not Monday morning forecasting. We have been very loud about this cold, and you saw me blog on it a couple of weeks ago — in case you were not aware of what was coming.

So, what is the conclusion as heating bills skyrocket?

1. The U.S. models for all the money spent come up woefully short.

2. Even the emperor (European) had no clothes last winter and to start this winter. As cold as it is, not a very good idea.

3. What the NWS puts out is NOT A FORECAST. It’s guidance with a very low bar for verification.

Pretty costly stuff.


Discover more from Climate- Science.press

Subscribe to get the latest posts sent to your email.

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.