A Python package for exploratory lithology analysis
Project description
Exploratory Lithology Analysis
Analysing driller’s logs is a tedious and repetitive task in many groundwater modelling projects. Automating the process of extracting useful information from driller’s logs allows spending less time on manual data wrangling, more time on its interpretation, and enhances the reproducibility of the analysis.
This package combines features to:
perform natural language processing on lithology descriptions in the logs, to detect primary and secondary lithologies
apply supervised machine learning to interpolate lithologies across a 3D grid
visualise interactively the 3D data
An introductory sample workflow gives an overview of some of these capabilities.
License
MIT (see License.txt)
Documentation
The documentation includes the API documentation.
Some notebook tutorials are hosted in a separate github repository pyela-doc. pyela-doc includes quick setup instructions with conda.
Get a quick tour of some legacy visualisations. Visualisation features are currently being reengineered.
Installation
The quickest way with conda. We recommend installing conda packages rather than pip packages whereever possible.
on Linux:
wget https://raw.githubusercontent.com/csiro-hydrogeology/pyela/testing/configs/ela_environment.yml
my_env_name=ELA
# my_env_name=elauser
conda env create -n $my_env_name -f ./ela_environment.yml python=3.7
conda activate $my_env_name
This should be all to get a working environment. If you want to use ‘ela’ from jupyter lab:
conda install --name ${my_env_name} jupyterlab ipywidgets jupyter
jupyter-labextension install @jupyter-widgets/jupyterlab-manager
python -m ipykernel install --user --name ${my_env_name} --display-name "Py3 ELA"
Windows:
call C:\Users\xxxyyy\AppData\Local\Continuum\anaconda3\Scripts\activate.bat
REM curl should come with anaconda3. Perhaps not with miniconda though.
where curl
cd c:\tmp
curl -o ela_environment.yml https://raw.githubusercontent.com/csiro-hydrogeology/pyela/testing/configs/ela_environment.yml
set my_env_name=ELA
conda env create -n %my_env_name% -f ela_environment.yml python=3.7
conda activate %my_env_name%
This should be all to get a working environment. If you want to use 'ela' from jupyter lab:
conda install -c conda-forge --name %my_env_name% jupyterlab ipywidgets jupyter
jupyter-labextension install @jupyter-widgets/jupyterlab-manager
python -m ipykernel install --user --name %my_env_name% --display-name "Py3 ELA"
Troubleshooting
pip packages specified from the environment.yaml files may have not installed (under investigation). Check that pvgeo and “our” packages are installed e.g.
conda activate %my_env_name%
conda list | grep pvgeo
if not present:
REM make sure you have git in the PATH e.g.
set PATH=C:\Users\per202\AppData\Local\Atlassian\SourceTree\git_local\mingw32\bin\;%PATH%
where git
conda activate %my_env_name%
pip install --no-deps pvgeo
pip install -e git+https://github.com/jmp75/striplog@master#egg=striplog
pip install --no-deps -e git+https://github.com/jmp75/pyela@master#egg=ela
Manual installation
As of January 2019 ela is on pypi. While pip install ela might work on some computers, it is unlikely that all python geospatial dependencies will install. We highly recommend you set up a conda environment with all dependent packages prior to installing ela with pip or from source.
‘ela’ relies on several external packages, and some can be fiddly to install depending on the version of Python and these external packages. This section thus has fairly prescriptive instructions, given in the hope of limiting the risk of issues.
Debian packages for spatial projections
cartopy and possibly other python packages require proj4 version 4.9+ to be installed (libproj-dev). If your debian/ubuntu repo does not suffice (older versions) you may try:
sudo apt-get install -y libc6
wget http://en.archive.ubuntu.com/ubuntu/pool/universe/p/proj/proj-data_4.9.3-2_all.deb
sudo dpkg -i proj-data_4.9.3-2_all.deb
wget http://en.archive.ubuntu.com/ubuntu/pool/universe/p/proj/libproj12_4.9.3-2_amd64.deb
sudo dpkg -i libproj12_4.9.3-2_amd64.deb
wget http://en.archive.ubuntu.com/ubuntu/pool/universe/p/proj/proj-bin_4.9.3-2_amd64.deb
sudo dpkg -i proj-bin_4.9.3-2_amd64.deb
wget http://en.archive.ubuntu.com/ubuntu/pool/universe/p/proj/libproj9_4.9.2-2_amd64.deb
sudo dpkg -i libproj9_4.9.2-2_amd64.deb
wget http://en.archive.ubuntu.com/ubuntu/pool/universe/p/proj/libproj-dev_4.9.3-2_amd64.deb
sudo dpkg -i libproj-dev_4.9.3-2_amd64.deb
Installation of python packages dependencies
We recommend installing Anaconda to install dependencies. Note that I recommend to not let anaconda change your startup file and change the PATH environment. To activate Anaconda you first need: source ~/anaconda3/bin/activate. Then choose a conda environment name.
Optionally, if your anaconda installation is a bit dated, you may want to do conda update -n base conda and conda update -n base anaconda-navigator
To create the conda environment for ela on Linux:
# source ~/anaconda3/bin/activate
my_env_name=ELA
conda create --name ${my_env_name} python=3.7
conda install --name ${my_env_name} rasterio cartopy geopandas pandas nltk scikit-learn scikit-image matplotlib vtk
conda activate ${my_env_name}
On Windows, using the DOS CMD prompt, assuming you installed Anaconda in user mode.
call %userprofile%\AppData\Local\Continuum\anaconda3\Scripts\activate.bat
set my_env_name=ELA
conda create --name %my_env_name% python=3.7
REM if using conda activate %my_env_name% I seem to loose conda from the command line, so:
conda install --name %my_env_name% conda
conda install --name %my_env_name% rasterio cartopy geopandas pandas nltk scikit-learn scikit-image matplotlib vtk
conda activate %my_env_name%
At this point we have installed all the python dependencies ela needs that are available via conda.
As of writing (2019-08) conda does not have pyqt5, and no suitable version of mayavi for python3. We resort to use pip. You may want to do first:
pip install --upgrade pip
For Python 3.x one needs to install pyqt5 for mayavi, as per these instructions. As of Jan 2019 be aware that there is a known issue in mayavi visual rendering with pyqt5 as a backend on Linux and ‘ela’ is affected by this. Nevertheless this is not a commplete blocker for most ‘ela’ features so installation instructions are kept here.
pip search pyqt5 | sort -g
pip search mayavi
pip install pyqt5
pip install mayavi
For users without admin rights,
conda install pyqt5
pip install mayavi --user
At this point all mandatory dependencies for ‘ela’ are installed.
Installing ELA
There are three options to access ela:
use the latest available on pip,
clone and install with setup.py
direct import of the package directory (this is done at runtime from e.g. a notebook)
pip search ela
pip install ela
Alternatively, from source with setup.py
pip install -r requirements.txt
python setup.py install
Optional dependencies
As of 2019-05 you can find new features using deep learning for classification in the submodule ela.experiment. You will need the additional dependencies:
conda install --name ${my_env_name} gensim tensorflow keras
pip install wordcloud
If reading xls files using pandas, need pkg xlrd with e.g. conda install xlrd
using Jupyter-lab
You may use ela as you prefer; we recomment using “Jupyter Lab” to write notebooks. See the Jupyter lab doc for official information.
The following should be enough otherwise to use ‘ela’
Linux:
my_env_name=ELA
conda install --name ${my_env_name} jupyterlab ipywidgets jupyter
jupyter-labextension install @jupyter-widgets/jupyterlab-manager
python -m ipykernel install --user --name ${my_env_name} --display-name "Py3 ELA"
Windows:
set my_env_name=ELA
conda install --name %my_env_name% jupyterlab ipywidgets jupyter
jupyter-labextension install @jupyter-widgets/jupyterlab-manager
python -m ipykernel install --user --name %my_env_name% --display-name "Py3 ELA"
Known issues
3D interactive visualisation - Using mayavi 4.6+ on python 3.6+ may be visually buggy on Linux. This appears to be a low level issue running on laptop with dual Intel/NVIDIA Graphic cards.
Troubleshooting
If in a conda environment trying to use pip you get:
ModuleNotFoundError: No module named 'pip._internal'
consider:
curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py
python3 get-pip.py --force-reinstall
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ela-0.7.3.zip
.
File metadata
- Download URL: ela-0.7.3.zip
- Upload date:
- Size: 51.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0e43e2c9cfba17f29e3369542173edafb0666d7b33f4eec9a5c3e907e35c6c8c |
|
MD5 | a5bd8705b8424f36a3cdf5a2979f4222 |
|
BLAKE2b-256 | eef80171b558e5f5d724633022230f612acfcc4c0e3155d8cd71efe450f9f8a5 |
File details
Details for the file ela-0.7.3-py2.py3-none-any.whl
.
File metadata
- Download URL: ela-0.7.3-py2.py3-none-any.whl
- Upload date:
- Size: 39.0 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e42bb2efa3361c5ead4bd57ae4b7190f682b28f4d1633c359b5f6eedae6575ce |
|
MD5 | cccfca4e86b20654bc15ed61fa8a9f68 |
|
BLAKE2b-256 | 16952b30cba4a2f686868523040e1cd81656fe1c47f8d6cfe7d8695951b2a456 |