{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# 26 Working with Well Data from the Geological Survey NRW\n", "\n", "This notebook presents the extraction of borehole data (location of wells and stratigraphy) from logs provided by the Geological Survey NRW. \n", "\n", "" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Set File Paths and download Tutorial Data\n", "\n", "If you downloaded the latest `GemGIS` version from the Github repository, append the path so that the package can be imported successfully. Otherwise, it is recommended to install `GemGIS` via `pip install gemgis` and import `GemGIS` using `import gemgis as gg`. In addition, the file path to the folder where the data is being stored is set. The tutorial data is downloaded using Pooch (https://www.fatiando.org/pooch/latest/index.html) and stored in the specified folder. Use `pip install pooch` if Pooch is not installed on your system yet." ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "ExecuteTime": { "end_time": "2021-03-17T11:21:58.167948Z", "start_time": "2021-03-17T11:21:55.976221Z" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "WARNING (theano.configdefaults): g++ not available, if using conda: `conda install m2w64-toolchain`\n", "C:\\Users\\ale93371\\Anaconda3\\envs\\test_gempy\\lib\\site-packages\\theano\\configdefaults.py:560: UserWarning: DeprecationWarning: there is no c++ compiler.This is deprecated and with Theano 0.11 a c++ compiler will be mandatory\n", " warnings.warn(\"DeprecationWarning: there is no c++ compiler.\"\n", "WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string.\n", "WARNING (theano.tensor.blas): Using NumPy C-API based implementation for BLAS functions.\n" ] } ], "source": [ "import gemgis as gg\n", "\n", "file_path ='data/26_working_with_well_data_from_GD_NRW/'" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "ExecuteTime": { "end_time": "2021-03-17T11:21:58.230549Z", "start_time": "2021-03-17T11:21:58.218285Z" } }, "outputs": [], "source": [ "gg.download_gemgis_data.download_tutorial_data(filename=\"26_working_with_well_data_from_GD_NRW.zip\", dirpath=file_path)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Loading the Well Data\n", "\n", "The data used for GemGIS is obtained from the Geological Survey NRW. It will be used under Datenlizenz Deutschland – Namensnennung – Version 2.0 (https://www.govdata.de/dl-de/by-2-0).\n", "\n", "The PDF Files can be loaded as strings using PyPDF2. " ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "ExecuteTime": { "end_time": "2021-01-01T10:11:06.485271Z", "start_time": "2021-01-01T10:11:06.415431Z" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "100%|████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 37.16it/s]" ] }, { "name": "stdout", "output_type": "stream", "text": [ "../../../../gemgis_data/data/26_working_with_well_data_from_GD_NRW/test_data.txt successfully saved\n" ] }, { "name": "stderr", "output_type": "stream", "text": [ "\n" ] }, { "data": { "text/plain": [ "'Stammdaten - 2521/ 5631/ 1 - Bnum: 196747 . . Objekt / Name :B. 19 ESCHWEILER\\n\\n Bohrungs- / Aufschluß-Nr. :19\\n\\n Archiv-Nr. :\\n Endteufe [m] :70.30\\n\\n Stratigraphie der Endteufe :Karbon\\n . TK 25 :Eschweiler [TK 5103]\\n\\n Ort / Gemarkung :Eschweiler/Weißweiler\\n\\n GK Rechtswert/Hochwert [m] :2521370.00 / 5631910.00\\n\\n UTM East/North [m] :32310019.32 / 5633520.32\\n\\n Hoehe des Ansatzpunktes [mNN] :130.00\\n\\n Koordinatenbestimmung :ungeprüfte Angabe aus dem Bohrarch'" ] }, "execution_count": 5, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data = gg.misc.load_pdf(path=file_path + 'test_data.pdf')\n", "data[:500]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Extracting Meta Data From the Well Data\n", "\n", "The meta data or 'Stammdaten' of the wells can be extracted using ``get_meta_data_df(...)``. Any duplicate wells will be removed automatically." ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "ExecuteTime": { "end_time": "2020-12-17T10:07:42.149429Z", "start_time": "2020-12-17T10:07:42.105866Z" } }, "outputs": [ { "data": { "text/html": [ "
\n", " | Index | \n", "DABO No. | \n", "Name | \n", "Number | \n", "Depth | \n", "X | \n", "Y | \n", "Z | \n", "X_GK | \n", "Y_GK | \n", "... | \n", "Kind | \n", "Procedure | \n", "Confidentiality | \n", "Record Type | \n", "Lithlog Version | \n", "Quality | \n", "Drilling Period | \n", "Remarks | \n", "Availability Lithlog | \n", "geometry | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "GD0001 | \n", "DABO_196747 | \n", "B.19ESCHWEILER | \n", "19 | \n", "70.30 | \n", "32310019.32 | \n", "5633520.32 | \n", "130.00 | \n", "2521370.00 | \n", "5631910.00 | \n", "... | \n", "Bohrung | \n", "\n", " | vertraulich, offen nach Einzelfallprüfung; | \n", "Übertragung eines alten Archivbestandes | \n", "1 | \n", "Schichtdaten von guter Qualität; genaue strati... | \n", "\n", " | \n", " | Original-Schichtenverzeichnis liegt vor | \n", "POINT (32310019.320 5633520.320) | \n", "
1 | \n", "GD0002 | \n", "DABO_196748 | \n", "B.16ESCHWEILER | \n", "16 | \n", "37.61 | \n", "32310327.14 | \n", "5632967.35 | \n", "122.00 | \n", "2521700.00 | \n", "5631370.00 | \n", "... | \n", "Bohrung | \n", "\n", " | vertraulich, offen nach Einzelfallprüfung; | \n", "Übertragung eines alten Archivbestandes | \n", "1 | \n", "Schichtdaten von guter Qualität; genaue strati... | \n", "\n", " | \n", " | Original-Schichtenverzeichnis liegt vor | \n", "POINT (32310327.140 5632967.350) | \n", "
2 rows × 26 columns
\n", "\n", " | Index | \n", "Name | \n", "X | \n", "Y | \n", "Z | \n", "Altitude | \n", "Depth | \n", "formation | \n", "geometry | \n", "
---|---|---|---|---|---|---|---|---|---|
0 | \n", "GD0001 | \n", "B.19ESCHWEILER | \n", "32310019.32 | \n", "5633520.32 | \n", "125.30 | \n", "130.00 | \n", "70.30 | \n", "Quaternary | \n", "POINT (32310019.320 5633520.320) | \n", "
1 | \n", "GD0001 | \n", "B.19ESCHWEILER | \n", "32310019.32 | \n", "5633520.32 | \n", "66.50 | \n", "130.00 | \n", "70.30 | \n", "Miocene | \n", "POINT (32310019.320 5633520.320) | \n", "
2 | \n", "GD0001 | \n", "B.19ESCHWEILER | \n", "32310019.32 | \n", "5633520.32 | \n", "60.90 | \n", "130.00 | \n", "70.30 | \n", "Oligocene | \n", "POINT (32310019.320 5633520.320) | \n", "
3 | \n", "GD0001 | \n", "B.19ESCHWEILER | \n", "32310019.32 | \n", "5633520.32 | \n", "59.70 | \n", "130.00 | \n", "70.30 | \n", "Carboniferous | \n", "POINT (32310019.320 5633520.320) | \n", "
4 | \n", "GD0002 | \n", "B.16ESCHWEILER | \n", "32310327.14 | \n", "5632967.35 | \n", "117.80 | \n", "122.00 | \n", "37.61 | \n", "Quaternary | \n", "POINT (32310327.140 5632967.350) | \n", "
5 | \n", "GD0002 | \n", "B.16ESCHWEILER | \n", "32310327.14 | \n", "5632967.35 | \n", "84.40 | \n", "122.00 | \n", "37.61 | \n", "Miocene | \n", "POINT (32310327.140 5632967.350) | \n", "
6 | \n", "GD0002 | \n", "B.16ESCHWEILER | \n", "32310327.14 | \n", "5632967.35 | \n", "84.39 | \n", "122.00 | \n", "37.61 | \n", "Carboniferous | \n", "POINT (32310327.140 5632967.350) | \n", "