Data wrangling resources

We have updated our data wrangling resources page with some additional links to data sets. If you know of any other good sources of datasets, or other data wrangling resources that might be useful, please reply here.

2 Likes

If anyone wants to play around with EDT, a timely CSV data set of UK fuel prices can be found here:

Fuel Finder - Access Latest Fuelprices

68 columns x 7,438 rows (2026-03-22).

1 Like

Something clearly wrong with the latitude and longitude of some rows:

For example, it has a Northern Ireland filling station in the Indian ocean. I think because they got the latitude and longitude the wrong way around.

A quick review in EDT :slight_smile: shows:

  • Latitudes min -55.861 and max 60.769
  • Longitudes min -8.096 and max 55.917

So lots wrong there.

Tighnabruich as an example (nice place, I nearly bought a static caravan there…) is 5.2403, but should be -5.2403.
I suspect they have some with +ves when they should be -ves, and lats and longs switched around.

I deal with marine traffic data sets quite a lot, and I get cr*p like this all the time (not just coords, but tonnage, speeds, etc.). I use 3 separate EDT files to help error check and fix them. And then a 4th to do my calcs and results.

1 Like

Also the most expensive fuel is more than 1000x as expensive as the cheapest:

I found this issues within a few minutes of loading the data into Easy Data Transform. So shame of the UK government for not doing at least some basic checks.

Perhaps the UK government needs to buy a copy of Easy Data Transform? ;0)

There is a page to report errors:
Fuel Finder - GOV.UK
And it seems to show that it’s all dependant upon forecourts reporting the correct data in the first place.

I have emailed them and suggested they buy a license. ;0)

It is poor that they don’t do some basic validation (IMO).

1 Like