Skip to main content

A Python framework for data-driven geochemistry discovery

Project description

Geochemistry π is a Python framework for data-driven geochemistry discovery. It provides an extendable tool and one-stop shop for geochemical data analysis on tabular data. The goal of the Geochemistry π is to create a series of user-friendly and extensible products of high automation for the full cycle of geochemistry research.

First Phase

It works as a software application with a command-line interface (CLI) to automate data mining process with frequently-used machine learning algorithms, which would further lower the threshold for the geochemists.

The highlight is that through choosing simple number options, the users are able to implement a completed cycle of data mining without knowledge of SciPy, NumPy, Pandas, Scikit-learn, FLAML, Ray packages.

Its models section, shown as below, provides both supervised learning and unsupervised learning methods to process the users' data, including four types of algorithms, regression, classification, clustering, and dimensional reduction.

The activity diagram of the Geochemistry π Version 1.0.0:

The whole package is under construction and the documentation is progressively evolving.

Team Info

Leader:

Core Developers:

  • Yang Lyu (Daisy, Zhejiang University, China)
  • Jianming Zhao (Jamie, Jilin University, Changchun, China)
  • Jianhao Sun (Jin, China University of Geosciences,Wuhan, China)
  • Shengxin Wang (Samson, Lanzhou University, China)

Members:

  • Wenyu Zhao (Molly, Zhejiang University, China)
  • Fang Li (liv, Shenzhen University, China)
  • Ting Liu (Kira, Sun Yat-sen University, China)
  • Kaixin Zheng (Hayne, Sun Yat-sen University, China)
  • Aixiwake·Janganuer (Ayshuak, Sun Yat-sen University, China)
  • Parnanjan Dutta (Presidency University, Kolkata, India)
  • Bailun Jiang (EPSI / Lille University, France)
  • Yongkang Chang (Kill-virus, Langzhou University, China)
  • Xirui Zhu (Rae, University of York, United Kingdom)

Join Us :)

The recruitment of research interns is ongoing !!!

Key Point: All things are done online, remote work (*^▽^*)

What can you learn?

  • Learning the full cycle of data mining on geochemistry data, including the algorithms in regression, classification, clustering, and decomposition.
  • Learning to be a qualified Python developer, including any Python programing contents towards data mining, basic software engineering techniques like OOP developing, and cooperation tools like Git.

What can you get?

  • Research internship proof and reference letter after working for > 100 hours.
  • Chance to pay a visit to Hangzhou, China, sponsored by ZJU Earth Data.
  • Chance to be guided by the experts from IT companies in Silicon Valley and Hangzhou.
  • Bonus depending on your performance.

Current Working Pattern:

  • Online working and cooperation
  • Three weeks per working cycle -> One online meeting per working cycle

Even if you are not familiar with topics above, but if you are interested in and have plenty of time to do it. That's enough. We have a full-developed training system to help you, as a newbie of data mining or Python developer, learn steps by steps with seniors until you can make a significant contribution to our project.

More details about the project?
Please refer to:
English Page: https://person.zju.edu.cn/en/zhangzhou
Chinese Page: https://person.zju.edu.cn/zhangzhou#0

Do you want to contribute to this open-source program?
Contact with your CV: sanyhew1097618435@163.com

In-house Videos

Technical record videos are on Bilibili and Youtube synchronously while other meeting videos are internal materials.

  1. ZJU_Earth_Data Introduction (Geochemical Data, Python, Geochemistry π) - Prof. Zhang
  2. How to Collaborate and Provide Bug Report on Geochemistry π Through GitHub - Can He (Sany)
  3. How to Run Geochemistry π v1.0.0-alpha - Can He (Sany)
  4. How to Create and Use Virtual Environment on Geochemistry π - Can He (Sany)
  5. How to use Github-Desktop in conflict resolution - Qiuhao Zhao (Brad)
  6. Virtual Environment & Packages On Windows - Jianming Zhao (Jamie)
  7. Git Workflow & Coordinating Synchronization - Jianming Zhao (Jamie)

Contributors

  • Qiuhao Zhao (Brad, Zhejiang University, China)
  • Anzhou Li (Andrian, Zhejiang University, China)
  • Xunxin Liu (Tante, China University of Geosciences, Wuhan, China)
  • Xin Li (The University of Manchester, United Kingdom)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geochemistrypi-0.0.1.tar.gz (2.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geochemistrypi-0.0.1-py3-none-any.whl (609.6 kB view details)

Uploaded Python 3

File details

Details for the file geochemistrypi-0.0.1.tar.gz.

File metadata

  • Download URL: geochemistrypi-0.0.1.tar.gz
  • Upload date:
  • Size: 2.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.8

File hashes

Hashes for geochemistrypi-0.0.1.tar.gz
Algorithm Hash digest
SHA256 d4a51972ba5c50be1e638f3446344aa138bd7c16aff62891242a817c0c8bdc93
MD5 4c86ef359ee14ef8f70cba9cadd352b4
BLAKE2b-256 f4a0d87e402d4a239253aa78e015d87ab87833378b0fd603855f3741fb67b0d3

See more details on using hashes here.

File details

Details for the file geochemistrypi-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: geochemistrypi-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 609.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.8

File hashes

Hashes for geochemistrypi-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1b85418243bb4436db4c28e1a125c987956a64cb4a9cfe842f07700e532e6a2d
MD5 32284f7309a5b97093129da2ff3aa790
BLAKE2b-256 005adbd9f7a760716f8551aef9c77aa2e9cceec277291c7917c20c1a1ddf06b6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page