Groundsource: Google Research turns news articles into flood data using Gemini

Google Research just dropped something I’ve been waiting for — a practical use of large language models that actually solves a real, painful data problem.

They’re calling it Groundsource, and the pitch is simple: take millions of news articles about floods, run them through Gemini, and turn that messy text into structured, geolocated, timestamped records of actual flood events. The first output is an open-access dataset of 2.6 million urban flash flood events, spanning 150+ countries from 2000 to today.

Let me be blunt: this is the kind of AI application that doesn’t make headlines but matters more than yet another chatbot demo.

The data desert problem

If you’ve ever tried to build a flood prediction model, you know the pain. Earthquakes have global sensor networks. Floods? Not so much. We’ve got satellite data, sure, but clouds block the view, satellites only pass over every few days, and they mostly catch big, slow-moving disasters. Quick flash floods — the ones that kill people in urban areas — slip through the cracks.

Existing databases like the Global Flood Database or Dartmouth Flood Observatory are useful, but they’re sparse. The UN’s GDACS system tracks about 10,000 high-impact events. That sounds like a lot until you realize you need hundreds of thousands of records to train a decent global model. 10,000 is a drop in the bucket.

Groundsource doesn’t replace those systems. It fills the gaps they leave.

How it works (the clever part)

Instead of waiting for satellites or official reports, Groundsource scrapes news articles — local newspapers, government bulletins, wire services — and feeds them into Gemini. The model extracts location, date, flood severity, and other structured fields from unstructured text.

This isn’t new in concept. People have been trying to mine news for disaster data for years. The difference here is scale and precision. Google claims the pipeline handles the noise of global media — multiple languages, varying reporting standards, conflicting accounts — and produces records with enough accuracy to be useful for modeling.

I’d love to see the false positive rate, but the paper (linked below) goes into detail on their validation methodology. They compared against known flood databases and got solid agreement.

What’s in the dataset

The flash flood dataset covers 2.6 million events. That’s two orders of magnitude more than GDACS. The temporal coverage goes back to 2000, which means you can actually train models on historical patterns, not just recent years.

They’re releasing it openly, which is the right call. Climate research is a public good, and locking this behind a paywall would defeat the purpose.

The bigger picture

Groundsource is a methodology, not a one-off dataset. Google explicitly says it could be applied to other hazards — wildfires, landslides, heatwaves. The same approach would work for any disaster that gets reported in local news but doesn’t have a dedicated sensor network.

I’d like to see them tackle earthquakes next, even though those already have good sensor coverage. The news angle could capture secondary effects that seismometers miss — building collapses, infrastructure damage, social disruption.

What I’m watching

Two things. First, how well does this data actually improve flood forecasting models? The dataset is impressive on paper, but real-world validation against actual flood events will tell the story. Second, can this methodology scale to real-time or near-real-time? If they can process news as it breaks and update the dataset daily, that changes the game from historical analysis to operational warning systems.

The paper and dataset are linked below. If you work in climate modeling, hydrology, or disaster response, this is worth your time.

Paper | Dataset | Flash floods blog | Flood forecasting initiative