gemgis.misc.get_meta_data#

gemgis.misc.get_meta_data(page: List[str]) list#

This function is used to extract the name, coordinates and depths, of one page with one well provided by the Geological Survey NRW. It is using the extracted page as string as input data and returns floats of the coordination data and the well name

Parameters

page (List[str]) – List containing the strings of the borehole pdf

Returns

data – List containing the extracted data values

Return type

list

New in version 1.0.x.

Changed in version 1.1.7.

Adapting positions of coordinate values.

Example

>>> # Loading Libraries and split data
>>> import gemgis as gg
>>> # Split Data - from get_meta_data_df(...)
>>> data = data.split()
>>> data = '#'.join(data)
>>> data = data.split('-Stammdaten')
>>> data = [item.split('|')[0] for item in data]
>>> data = [item.split('#') for item in data]
>>> # Filter out wells without Stratigraphic Column
>>> data = [item for item in data if 'Beschreibung' in item]
>>> # Get Coordinates of data
>>> coordinates = [get_meta_data(page=item) for item in data]
>>> coordinates[0]
['DABO_196747', 'B.19ESCHWEILER', '19', 70.3, 32310019.32, 5633520.32, 130.0,
2521370.0, 5631910.0, 'Karbon', 'Eschweiler [TK 5103]', 'Eschweiler/Weißweiler',
'ungeprüfte Angabe aus dem Bohrarchiv', 'ungeprüfte Angabe aus dem Bohrarchiv',
'Exploration, Lagerstättenerkundung', 'Bohrung', '', 'vertraulich, offen nach Einzelfallprüfung;',
'Übertragung eines alten Archivbestandes', '1', 'Schichtdaten von guter Qualität; genaue stratigrafische
Einstufung aufgestellt', '', '', 'Original-Schichtenverzeichnis liegt vor']

See also

load_pdf

Loading PDF data as string

get_meta_data_df

Getting the meta data of wells as DataFrame

get_stratigraphic_data

Getting the stratigraphic data of a well

get_stratigraphic_data_df

Getting the stratigraphic data of wells as DataFrame