Does your machine learning smell?

Martin Fowler and Kent Beck popularized the term ‘code smell’ in the book Refactoring. They were describing the subtle signs of deeper trouble in code — signs that a program’s source code might need refactoring (restructuring and rewriting). There are too many aromas to list here, but here are some examples (remember, these things are not necessarily problems in themselves, but they suggest you need to look more closely):Duplicated code.Contrived complexity (also known as showing…

Continue reading


An update on Volve

Writing about the new almost-open dataset at Groningen yesterday reminded me that things have changed a little on Equinor’s Volve dataset in Norway. Illustrating the principle that there are more ways to get something wrong than to get them right, here’s the situation there.In 2018, Equinor generously released a very large dataset from the decommissioned field Volve. The data is undoubtedly cool, but initially it was released with no licence. Later in 2018, a licence…

Continue reading


A big new almost-open dataset: Groningen

Open data enthusiasts rejoice! There’s a large new openly licensed subsurface dataset. And it’s almost awesome. Go to the dataset The dataset has been released by Dutch oil and gas operator Nederlandse Aardolie Maatschappij (NAM), which is a 50–50 joint venture between Shell and ExxonMobil. They have operated the giant Groningen gas field since 1963, producing from the Permian Rotliegend Group, a 50 to 225 metre-thick sandstone with excellent reservoir properties. The dataset consists of…

Continue reading


Rock, paper, scissors: the perfect gift

It’s winter festival time again. Pat yourself on the back for showing 2020 that it’s going to take more than a global pandemic, environmental chaos, and geopolitical instability to see you off. Hold your dear ones near and let’s all look forward to no more of whatever the heck this year was.In the meantime, maybe you’re looking for a gift to brighten the life of your favourite geologist — or maybe you want to appreciate a…

Continue reading


x lines of Python: static basemaps with contextily

Difficulty rating: Beginner Something that is often useful in planning is to have a basemap of the area in which you have data or an interest. This can be made using a number of different tools, up to and including full-fledged GIS software, but we will use Contextily for a quick static basemap using Python. Installation is as simple as using conda install contextily or pip install contextily. The steps that we want to take…

Continue reading


The deep time clock

Check out this video by a Finnish Lego engineer on the Brick Experiment Channel (BEC):This brilliant, absurd machine — which fits easily on a coffee table — made me think about geological time.Representing deep time is a classic teaching problem in geoscience. Most of them are variants of “Imagine the earth’s history compressed into 24 hours” and use a linear scale. It’s amazing how even the Cretaceous is only 25 minutes long, and humans arrived…

Continue reading


The hot rock hack happened

I was excited about the World Geothermal Congress this year. (You remember conferences — big, expensive, tiring lecture-marathons that scientists used to go to. But sometimes they were fun.)Until this year, the WGC has only happened every 5 years and we missed the last one because it was in Australia… and the 2023 edition (it’s moving to a 3-year cycle) will be in China. So this year’s event, just a stone’s throw away in Iceland, was…

Continue reading


Stratigraphic interpretation

Over the weekend, Lisa Stright posted a question on the Software Underground’s Slack: “[… W]hat are you favorite seismic attributes for analyzing subtle stratigraphic changes in vertical sections (not time or strata slices)? I’m trying to quantitatively analyze sedimentological information from seismic data and am curious about either best practices and/or useful approaches in reservoir geophysics.” I really like questions like this, because they tend to bring out a lot of useful tricks and ideas…

Continue reading


The evolution of the Software Underground

The Software Underground started as a mailing list in 2014 with maybe twenty participants, in the wake of the first geoscience hackathons. There are now more than 2,160 “rocks + computers” enthusiasts in the Underground, with about 20 currently joining every week. It’s the fastest growing digital subsurface water-cooler in the world! And the only one.The beating heart of the Software Underground is its free, open Slack chat room. Accessible to anyone, it brings this…

Continue reading


Geoscientist, challenge thyself

No costume is required for solving geocomputing kata One of the highlights of my year is the Advent of Code, a sort of advent calendar for nerds. Its creator, Eric Wastl (hear his story), releases a new puzzle every day from the 1st of the month up to Christmas day. And the productivity of the global developer community goes down 74%.Ever since the first one I tried, I’ve been wondering what geological coding challenges might…

Continue reading