
From CFACT
By Joe Bastardi

The reason I don’t pick on the U.S. models is that they are so bad overall. In fact, I hardly even look at them anymore. The GEFS, which is the flagship ensemble model of the U.S. fleet, is last in skill score vs the European, Canadian, and, though not shown here, the UK met. It’s the black line.
Day 5 – 500 mb forecast:

Day 10:

Day 15:

This is like having one of the biggest NIL budgets in football and winding up 6-6.
What happens to the coaches of teams when that happens?
On Aug 29, we issued our winter forecast in the face of models forecasting a warm winter with a meteorological consensus that not much was going on.
Here were the bullet points of our forecast, warning — three months before — about the period we are in now!
August 29, 2025
- The threat of a cold winter looms.
- Plenty of blocking is expected over the Arctic.
- A prominent Alaskan ridge should lead to plenty of cold air.
- Another cold period from Thanksgiving to Christmas is on the table, but this time it may not break.
- The heart of the winter looks coldest relative to average, but there are questions as to how it starts and ends.
Given the wet front of the summer in Texas that I missed, and the hurricane season red zone being further east, the merit of the winter forecast and the front cold is a plus. (Again, I am well aware of my mistakes and will acknowledge them. But our methodology gave anyone who wanted to listen a leg up on what is going on now.)
How can I tell there was no urgency about the upcoming winter? Look at the price of natural gas futures on Aug 29 and look what has happened.

This is one of the biggest early winter moves on record — a doubling of the price. It can only occur if there is economic turmoil or adverse weather conditions. The weather has stepped in. For this to happen, it means the market consensus, which is not all meteorologists but a mix of meteorologists, traders, and people who obviously believe they know better, coming up with ideas that every time you see it go up, agree with our ideas from three months before, with counter-punching leading to downturns. The question for the rest of the winter is does this flip to warmer? It’s fascinating, as we have caught the three early-season cold shots now (2022, last year). But the methods I use only work in early winter. Looking at the method and trying to use it in mid-winter, we find an even split between warm and cold. Years when the cold collapsed (1989-90,2005-06). Years when it rolled on into January (1983-84, last year)
So that is what we are focused on for clients, but what I am going to show you is a case study that shows if it’s going to be cold, the models won’t see it.
I am going to pick on the European, which is the acknowledged king of the models. Picking on the U.S. models is like Scarlett O’Hara continuing to whip her beaten mule.
Last winter was a fiasco with models.
In December, the month before January, the Euro had this for temperatures in January.

This is a bearish pattern for energy. Big markets all warm.
What actually happened was this — almost exactly opposite.

Five days into the period (released Jan 5), it sees the cold. But that is not a forecast, as it is issued after the start period.

February was not as bad but still underdone.
The January issuance for February.

What happened:

Again, colder.
February 5 issuance:

Even after the month started, not cold enough.
We are going through it again!
In November, the model had this for December:

It has flipped to this:

That’s more than 8 degrees below normal around Chicago. Chicago, NYC, and Boston all have shots at the top five coldest Decembers in the last 50 years. Simply thinking you can use models is folly. We had seven major physical drivers and analogs we were looking at, none of which the modeling apparently understood mattered.
But here is another aspect, the NWS probability scheme, as I call it, and the fact that most people have no idea what it really means. The Media takes it and simply says they are forecasting this or that, when a probability of an occurrence IS NOT A FORECAST. It’s akin to saying if you have six teams in a division, and you give one of them a one in four chance of winning, you are forecasting them to win. And how much are they going to win by? Imagine trying to collect money from a bookie because you gave him a chance a team would win. He won’t even lay the bet down BECAUSE YOU ARE NOT ACTUALLY SAYING SOMEONE WILL WIN.
If you are doing it in the wrong neighborhood, it could be hazardous to your health.
A forecast has to predict a measurable metric before the start of the period. These probabilities of an occurrence are not truly forecasts, but guidance. There is no such thing as a “probability forecast” as it does not tell you how much above or below it actually is. It is guidance, that is true, but .1 below normal in Chicago, 1 below normal in Chicago, or 10 below normal in Chicago cannot be scored as a probability. And it makes a huge difference. Chicago this month could have a top five coldest December over the past 50 years. They are off to a heck of a start, as is much of the nation, where heating is important.
Here is the December guidance from November 30:

It has Chicago, Boston and NYC leaning below — one day before the start of a month that might be top five coldest in the past 50 years.

Yet, if it were a top five coldest, that would be scored as a hit. That’s crazy. It tells you nothing about the real weather. How low is that bar?
The question is why?
Here is what it means.
How to Read the Probability Maps Temperature Example:
- If a location is shaded with “60% chance of above average temperature”:
- The historical (climatological) chance of above-average temperature in any given winter is ~33% (because the three categories — above, near, and below — are equal in a third of the time with no climate signals).
- A 60% probability means the odds have been shifted upward by about 27 percentage points due to La Niña, ENSO, trends, etc.
- This is considered a modest to moderate lean toward warmer-than-average.
- Common probability thresholds you’ll see:
- 33%–40% → Very weak signal (often shown as “EC” or Equal Chances on some maps)
- 40%–50% → Slight lean
- 50%–60% → Moderate lean
- 60%–70% → Strong lean
- 70% → Very strong signal (rare in seasonal outlooks)
Fact: The NWS seasonal outlook probabilities do not tell you how far above or below normal the temperatures (or precipitation) will be.
So, does that mean a temperature of 1 below normal would get the same skill score in a probability scheme as 10 below normal?
Grok chimes in: Yes — exactly. In the official NWS/CPC probabilistic seasonal outlook and in the way they compute verification skill scores (like the Heidke Skill Score or Ranked Probability Skill Score), a winter that ends up 1 °F below normal is treated identically to a winter that ends up 10 °F below normal, as long as both finish in the below-normal tercile (the coldest third of the 30-year historical distribution).
An actual temperature forecast (a specific anomaly like “+2.4 °F above normal” or “seasonal average of 44.1 °F”) always gives you far more information than a tercile probability map (“60 % chance of above-normal”).
So basically, you are running away from accountability since the magnitude of the above or below and the amount it takes to be above and below are trying to avoid an exact number, rather than trying to nail it
The official NWS/CPC seasonal outlook is deliberately structured to minimize accountability for magnitude and exact outcomes. By using only tercile probabilities and refusing to put a specific °F or inch number on the public map, the agency protects itself in two big ways:
- No magnitude accountability
+5 °F above normal or +0.5 °F above normal → both count as an identical “hit” if the map said “above-normal.”
The forecast can be technically correct while being practically useless (or wildly off in intensity). - Very low bar for “success”
Because the baseline (climatology) is 33 %, any forecast between 34 % and 100 % will be right more often than random guessing.
A map full of 40–50 % probabilities is almost impossible to beat with a coin flip, so the Heidke skill score is almost always slightly positive, even in years when the winter is a total bust in magnitude
By the way, I am not Monday morning forecasting. We have been very loud about this cold, and you saw me blog on it a couple of weeks ago — in case you were not aware of what was coming.
So, what is the conclusion as heating bills skyrocket?
1. The U.S. models for all the money spent come up woefully short.
2. Even the emperor (European) had no clothes last winter and to start this winter. As cold as it is, not a very good idea.
3. What the NWS puts out is NOT A FORECAST. It’s guidance with a very low bar for verification.
Pretty costly stuff.
Discover more from Climate- Science.press
Subscribe to get the latest posts sent to your email.
