Skip to main content

A LangGraph/LLM‐driven EDA → AutoML → report pipeline

Project description

MI-Agent

An agentic workflow for materials-informatics (MI) engineers, built with LangGraph and powered by OpenAI models. MI-Agent codifies the end-to-end MI pipeline—data loading, merging, feature selection, EDA, AutoML baselining, hyperparameter tuning, and executive reporting—into reusable nodes orchestrated as a LangGraph. LangSmith integration tracks and visualizes your graph executions. The result? MI workflows that run in seconds instead of hours, boosting your productivity by an order of magnitude.


🚀 Why MI-Agent?

  • Agentic LangGraph design lets you hit “play” on a full MI pipeline
  • 10× faster: eliminate boilerplate and manual scripting
  • Extensible nodes: swap in your own extractors, metrics, or plots
  • LangSmith-backed for graph tracking, versioning, and observability
  • Production-ready: versionable, testable, pip-installable

🛠️ Prerequisites


Installation via pip

  1. Create & activate a conda environment

    conda create -n mi_agent python=3.10 -y
    conda activate mi_agent
    
  2. Install via pip

    pip install materials_informatics_agent
    
  3. Configure your API keys

    MI-Agent will automatically look for a file named .env in your current working directory (or any parent) and load any keys it finds.

    In the folder where you’ll run the CLI (or in any ancestor), create a file called .env containing:

    OPENAI_API_KEY=sk-…
    LANGCHAIN_API_KEY=lsv2_…
    
  4. Prepare your problem file

    MI-Agent requires a .txt file (an example is provided in the sample_problem.txt in the project root of the source code) which contains:

    • your problem description

    • relative paths to your CSV(s), including any folder prefix (e.g. data/sample_data.csv)

    Example problem.txt:

    You are tasked with predicting alloy strength from composition data...
    
    - data/sample_data_1.csv: Contains experimental results...
    - data/sample_data_2.csv: Contains formulation recipes...
    
  5. Run the agent

    Now, start the mi_agent pipeline as below:

    mi_agent --problem-file <path/to/problem.txt> --output-dir <path/to/output_dir>
    

    MI-Agent will:

    • Identify & load the CSV(s) listed in the problem file
    • Merge files if needed
    • Select target & features
    • Propose & execute EDA
    • Save all generated code (*.py) for EDA analysis and images (*.png) generated during EDA into <output_dir>
    • Run multiple ML models, select top 5, tune hyperparameters, and choose the best model
    • Generate and save a 5-page technical summary into <output_dir>
    • Log all reasoning steps to LangSmith

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

materials_informatics_agent-0.1.23.tar.gz (48.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

materials_informatics_agent-0.1.23-py3-none-any.whl (23.2 kB view details)

Uploaded Python 3

File details

Details for the file materials_informatics_agent-0.1.23.tar.gz.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.23.tar.gz
Algorithm Hash digest
SHA256 496fa604e984e7691d9339f050fb02cc97ba88aed9cab194a6be2a19cdcb14d0
MD5 4f8434db5daa162dfc93f54cf0b5401f
BLAKE2b-256 4c4055a7af454495f9231a4a078749998cac26852bb994990ef25f72d5400160

See more details on using hashes here.

File details

Details for the file materials_informatics_agent-0.1.23-py3-none-any.whl.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.23-py3-none-any.whl
Algorithm Hash digest
SHA256 d84453504a4fbbcd599b520b13054ea9e2e6247a269b876e593b43811c1d082c
MD5 990fe66431ae698de4ed41f224a67bae
BLAKE2b-256 f474266718b2f532f8eb5af9daebe399657b11b6a23b6e5e9257f5ff925dbc70

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page