First Break           EarthDoc           Learning Geoscience

EAGE AI Committee - Newsletter on AI, ML and all things Digitalization #6/2023

June-July 2023

As a group of EAGE members and volunteers, the EAGE A.I. Committee is dedicated to helping you navigate the digital world and finding the bits that are most relevant to geoscientists. This month’s newsletter is summarizing our key machine learning and artificial intelligence takeaways from the EAGE Annual Convention, held in Vienna in June. We hope you find some interesting reflections and links to technical contributions that we as a Committee felt were important to share. 

 

You are welcome to join EAGE or renew your membership to support the work of the EAGE A.I. Community and access all the benefits offered by the Association.

EAGE Membership Benefits: Join or Renew

 

Curious to know all EAGE is doing for the digital transformation? 

Visit the EAGE Digitalization Hub

 

 

EAGE Annual Hackathon – Natural Language Processing

Summary by: Lukas Mosser

What? 

A two day hackathon was run during the pre-conference workshops looking at how we can use language processing to improve interactions with geoscience and engineering information.  With some great work from the hackathon facilitators, all of the submitted abstracts from this year’s EAGE Annual were the text body that the teams had to hack on.  Using some of the Large Language Models (LLMs) that have come out this year (dominated by OpenAI’s ChatGPT), the teams were able to all dig into challenges related to asking for insight from a bunch of text.

🚢 GeoExplorers – This team presented an easy-to-use open source interface to all your structured and unstructured data! Github:Link to Github code This team won best in show with an awesome app that you point at a folder of data, and you can ask it what is in those folders in a chatbot interface! Really cool and would actually be useful for many.

📊 Abstracters – This team analysed this years almost 900 EAGE Annual abstracts and showed us the power of data analytics on documents – who is doing what, the hot topics in geoscience, and how to become better authors! Github: Link to Github Code This team won the audience award voted by the other teams. This team also came from zero coding background and were amazing how easy it was to learn and apply in under 24 hours!

📅 AskAetos – This team created #AskJeeves for the EAGE Annual agenda. Type a topic or question you want and get out your own personalized agenda and paper suggestions. (see example below in the “Agenda Builder App”)

🏔 GeoDude – This team showed us the power of supplementing OpenAI ChatGPT and opensource LLMs with contextual knowledge and when fine-tuning can help. Great github and implementation to check out: Link to Github Code

 

Why this is useful? 

There is value in understanding how well we can customize these LLMs for very specific problems – especially where a certain set of scientific nomenclature is used. In general, the hackathon projects showed that there is great potential for LLMs for every day matters in the energy industry (help me schedule something, help me know more about this basin..etc).  Who knows – maybe we will see improvements to the EAGE events app using some LLMs?!

 

Most of the code is available open source and free to use in the links above!

 

 

EAGE Annual AI Dedicated Session: Going Big! Scaling Machine Learning in Geoscience/Engineering Workflows

Summary by: Ashley Russell

What? 

One of the AI Special Interest Group’s responsibilities at EAGE this year was planning and hosting a specific dedicated session on machine learning operations – or in other words the ability to maintain machine learning models over time and through user feedback. I gave a small introduction on what MLOps is and why we should care about it – maintaining machine learning models in use across different people, different assets, different geology or facilities is new, and challenging for us, with very little Google-like capability or competence in our companies, but it is one of the ways we can scale data science across with few data scientists.

We had 4 technical talks and a panel of experts led us through a discussion.

Some key takeaways:

  • The technical presenters showed examples with well log data (various types of prediction) and with image classification. But one keen person in the audience mentioned that seismic was not mentioned. Yes, we are doing data science with various seismic interpretation processes, but the general agreement is that we are not as mature on MLOps with seismic yet – the format of the data is not recognized by the cloud ML services that many of us are using.
  • It appears that full automation in machine learning is not what we want. Again a general agreement is that a human in the loop is important on top of automated CI/CD to confirm that the model is redeployed with the right prediction quality.
  • Feedback from the users of predictions is the most important part of MLOps to capture and use – otherwise you have no reason to improve or change. It is also perhaps the most difficult part to implement. So the implementation of data science and the application of it to improve our work days are very dependent on engaged colleagues!

 

Why this is useful? 

This is a complex system and by sharing different perspectives and solutions on operating machine learning models and incorporating feedback allows us to choose more wisely going forward.  

 

 

EAGE Annual Valuable Technical Presentation: “A Machine Learning Enabled Workflow for Efficient Channel Detection from Seismic Data” – Dr. Graham Baines, Halliburton

Summary by: Ashley Russell

What? 

We can use supervised machine learning in commercial software today to for example extract salt, or certain horizons, but this case was exploring how to automatically find 

channel bodies in seismic. The typical Machine Learning workflow today is that you extract various section of seismic and an interpreter manually “paints” over the channels to mark what is channel and what is not to set up a training data set – a lot of time to do this! This author tried using several automated processes to do the “painting” automatically, and then all the interpreter needs to do is confirm – yes it is a channel or no it is not. Then this is used as the training data for a ml algorithm to search for channels in un-interpreted seismic. They showed two methods – first with an open source python code from remote sensing to extract river systems from satellite images (image to right). It was not successful when applied to seismic spectral decomp slices. The second method using a segmentation algorithm such as SegmentAnything from Meta was far more successful. You can read the abstract here if you are an EAGE member.

Why this is useful? 

This talk was addressing a fundamental problem in any supervised classification machine learning workflow. Someone has to spend the time to label the classes in the training data – which is time intensive and often very subjective.  The ease of using a segmentation algorithm to “mosaic” seismic is a clever way to make it easy to do labeling.

Image source: isikdogan/rivamap

 

 

EAGE Annual Valuable Presentation: “AI Grid Design for Fast Reservoir Simulation” – Dr. Long Nghiem, CMP

Summary by: Ashley Russell

What? 

I am not a reservoir engineer, but I am aware of the challenges that are arising with computing costs on cloud (for example) to do flow simulation. This was a very popular presentation where Dr. Nghiem presented a new method for flow simulation that reduces computational time up up to 443x! It works by simplifying the 3D simulation model in several ways. First they perform a feature reduction to take all the parameters in the grid cells and reduce them to only 3 factors. Then they only simulate in a “strip” of grid cells between injector and producer pairs. This method is still in development, but they shared some benchmarks that the history matched production was different by 1.3% when compared with a full 3D simulation. Today they have only tested waterflood but the results look very promising! 

 

You can read the abstract here if you are an EAGE member.  The image shown here is from the abstract.

 

Why this is useful? 

There was a “food for thought” question from a student in the audience who asked “my textbooks on simulation are 30 years old – how are we using new data science methods to make it better?”  Clearly the research that Dg. Nghiem is presenting here is on how do we do a step change to bring about the future of reservoir simulation.

 

 

More on digitalization?

 

This newsletter is edited by the EAGE A.I. Committee.

NameCompany / InstitutionCountry
Jan H. van de MortelIndependentNetherlands
Julio CárdenasSorbonne UniversitéFrance
George GhonCapgeminiNorway
Lukas Mosser Aker BPNorway
Oleg OvcharenkoNVIDIAUnited Arab Emirates
Nicole GrobysDGMKGermany
Roderick PerezOMVAustria
Surender ManralSchlumbergerNorway
Facebook
Twitter
LinkedIn

Leave a Reply

Your email address will not be published. Required fields are marked *

2021

Siddharth Misra

Prof Dr Siddharth Misra’s research focuses on improving subsurface characterization and prospect evaluation for the exploration of hydrocarbons, minerals and water resources.

His major contribution is in the theory of electromagnetic responses of geological formations to various charge polarization phenomena. The theory has enabled him to introduce a multi-frequency electromagnetic log-inversion technique to remove dielectric effects for improved estimation of hydrocarbon pore volume.

           First Break           EarthDoc