The digital learning ecosystem An efficient management approach to capability development, delivering smarter teams, improved productivity and better business outcome for the managers.
Bridging industry with academia An immersive and collaborative learning experience event, using OilSim simulator, providing highly relevant industry knowledge and soft skills.
The digital learning ecosystem Digitally and seamlessly connecting you, the learner, with pertinent learning objects and related technologies ensuring systematic, engaging and continued learning.
Industry and client recognition
Best Outreach Program Finalist: WorldOil Awards
Overall Customer Satisfaction Score
Training provider of the year: 2013, 14 and 15
Upstream learning simulator With more than 50,000 participants instructed in various disciplines, data driven OilSim runs real-world oil and gas business scenarios and technical challenges.
Engaging. Educational. EnjoyableUpstream learning simulator With more than 50,000 participants instructed in various disciplines, data driven OilSim runs real-world oil and gas business scenarios and technical challenges.
Engaging. Educational. EnjoyableBridging industry with academia An immersive and collaborative learning experience event, using OilSim simulator, providing highly relevant industry knowledge and soft skills.
The digital learning ecosystem Digitally and seamlessly connecting you, the learner, with pertinent learning objects and related technologies ensuring systematic, engaging and continued learning.
We’re here to help!
Ask a question or leave a
comment using our Contact Us
form.
Upstream learning simulator With more than 50,000 participants instructed in various disciplines, data driven OilSim runs real-world oil and gas business scenarios and technical challenges.
Engaging. Educational. EnjoyableUpstream learning simulator With more than 50,000 participants instructed in various disciplines, data driven OilSim runs real-world oil and gas business scenarios and technical challenges.
Engaging. Educational. EnjoyableBridging industry with academia An immersive and collaborative learning experience event, using OilSim simulator, providing highly relevant industry knowledge and soft skills.
Develop measurable skills and capabilities
About this course
Interest in data science and machine learning is rapidly expanding, offering the promise of increased efficiency in E&P, and holding the potential to analyze and extract value from vast amounts of under-utilized legacy data. Combined with petroleum geoscience and engineering domain knowledge, the key elements underlying the successful application of the technology are: data, code, and algorithms. This course builds on public datasets, code examples written in Python, statistical graphics, and algorithms from popular data science packages to provide a practical introduction to the subject and its application in the E&P domain
What you’ll learn
The course comprises a mix of lectures and hands-on computer workshops. You’ll gain a working knowledge of coding in Python. You’ll learn the tradecraft of data import and manipulation, data visualization, exploratory data analysis, and building predictive models from data. You’ll also gain a powerful working environment for data science on your own computer, which together with code examples provided by the course will give you a jump start to applying the techniques you’ll learn to your own projects. For a flavor of what you’ll learn, check out this gallery of visualization samples drawn from the course workshops.
What data sources are used?
Using real E&P data sources is an important element of the hands-on computer workshops. This course makes extensive use of open data provided the UK Oil and Gas Authority and the UK National Data Repository. These data sources are not only typical of the challenges and complexity presented by E&P datasets, but also contain sufficient data quality issues to make them ideal for teaching the all important skills of data cleaning and manipulation. The course makes use of well logs, tops, seismic, and production data from these sources. The data are released in the public domain and you can continue to use these sources as you gain in experience after the course.
What data science tools are used ?
The course introduces a data science toolkit based on Visual Studio Code from Microsoft. This free product is rapidly growing in popularity as an environment for Python coding and data science. We think this toolkit provides a best-in-class environment for learning data science and subsequently moving to work on real projects, and we provide a free extension to further enhance it’s data science capabilities. More information about our data science extension is available here. The toolkit components will be installed on your computer - the advantage of this approach over cloud-based platforms is that your data is never uploaded to the cloud (if security is an issue), and you will be able to continue working when offline (if internet access is an issue).
Module 1. Overview
• What is Data Science - Overview of the course, and an outline of the scope of data science.
• Data Science for E&P - Addressing the role of data science in E&P and an example application to log
data quality control and reconstruction using machine learning.
Module 2. Data Science Toolkit - Notebooks, Visualization, and Communication
• Overview of the data science toolkit.
• Hands-on workshop introducing the toolkit and getting started with Python scripts and notebooks.
• Overview of how to manage and use Python packages.
• Hands-on workshop on Python packages covering how to install and manage packages, and how to use
packages from your Python notebooks.
• Introduction to data visualization with SandDance.
• Hands-on workshop introducing SandDance for interactive data visualization using a dataset of offshore
wells from the UK Continental Shelf.
• Overview of Markdown, a lightweight markup language for adding simple formatting to plain text
documents, and documenting Python notebooks.
• Hands-on workshop on Markdown for formatting text documents and annotating Python notebooks.
Module 3. Python Fundamentals
• Python 101 - Introduction to Python fundamentals including variables, types, statements, expressions,
control flow, and functions.
• Hands-on workshop on Python 101.
• Python 102 - More Python fundamentals including modules, files and folders, data structures, and data
frames.
• Hands-on workshop on Python 102.
Module 4. Computational Thinking
• Introduction to Computational Thinking - the analytical and logical processes of decomposing a complex
task and expressing it in a form that can be performed by a computer.
• Hands-on workshop on Computational Thinking applied to the design and implementation an interactive
base map for UK E&P data.
Module 5. Exploratory Data Analysis
• Exploratory Data Analysis - Introduction to the Exploratory Data Analysis process and key Python
packages for data analysis and statistical graphics.
• Hands-on workshop on exploratory data analysis - reading data, handling dates, merging datasets,
creating statistical graphics figures, exporting results.
• Statistical Graphics - Demonstration of a gallery of statistical graphics samples.
• Descriptive Statistics - Introduction to univariate and multivariate statistics.
Module 6. Exploring E&P Data
• Well header data - Introduction to handling well header data (surface location and attributes) using the
pandas and plotly packages.
• Hands-on workshop on well header data - including import, data cleaning, date handling, posting well
data on cultural/satellite base map and visualizing historical trends.
• Production data - Introduction to handling field production data using the pandas and plotly packages.
• Hands-on workshop on field production data - including import, data cleaning, date handling, queries,
visualizing hierarchical and time series data.
• Well log data - Introduction to handling wireline logs from LAS files using the lasio, pandas, and plotly
packages.
• Hands-on workshop on well log and tops data - including LAS file import, merging tops, and data
visualization.
• Seismic data - Introduction to handling seismic SEG-Y data using the segyio, and plotly packages.
• Hands-on workshop on seismic data - including SEG-Y file import, extracting binary and trace headers,
visualizing seismic trace data, and calculating seismic attributes.
Module 7. Geospatial Data
• Coordinate reference systems - Introduction to geographic and projected coordinate systems, defining a
coordinate reference system from EPSG codes, offsets between coordinate reference systems, and
transforming positions between reference systems.
• Hands-on workshop on coordinate reference systems.
Module 8. Machine Learning Fundamentals
• Machine Learning - introduction to the fundamentals of machine learning including background
concepts, the different types of machine learning, and the basic workflow to build and evaluate models
from data.
• Supervised learning with regression - introduction to regression including random forest regression and
performance evaluation.
• Hands-on workshop on regression for reconstructing wireline logs.
• Unsupervised Learning - introduction to unsupervised learning for dimensionality reduction, clustering
and outlier detection.
• Hands-on workshop on dimensionality reduction for wireline logs.
• Explainable Machine Learning - introduction to explainable machine learning: techniques for looking
inside the so-called black box models of machine learning to understand why particular predictions are
made and which variables are important.
Who should attend?
This is an introductory course for reservoir
geologists, reservoir geophysicists, reservoir engineers, data management, and
technical staff who want to learn the key concepts of data science. By
developing your data science skills you’ll be better equipped to analyze your
project data, build predictive models, and apply them in your workflows. You’ll
also be in a better position to evaluate and ask the right questions about the
work of others, be they in-house data science specialists or external partners.
The course is at an introductory level and all subject matter will be taught from scratch. No prior experience of statistics, Python coding or machine learning is required, although some basic college level knowledge of math and statistics is useful. Hands-on computer workshops form a significant part of this course, and participants must come equipped with a laptop computer running Windows (8, 10) or MacOS (10.10 or above) with sufficient free storage (4 Gb). Detailed installation instructions are provided in advance so that participants can set up their computer with the data science toolkit and course materials before the course starts. This course is available in both classroom and online formats.
Currently there are no scheduled classes for this course.
Click below to be alerted when scheduled
Your course has been added to the wishlist
Customize your own learning journey and track your progress when you start using a defined learning path.