Skip to main content

A LangGraph/LLM‐driven EDA → AutoML → report pipeline

Project description

MI-Agent

An agentic workflow for materials-informatics (MI) engineers, built with LangGraph and powered by OpenAI models. MI-Agent codifies the end-to-end MI pipeline—data loading, merging, feature selection, EDA, AutoML baselining, hyperparameter tuning, and executive reporting—into reusable nodes orchestrated as a LangGraph. LangSmith integration tracks and visualizes your graph executions. The result? MI workflows that run in seconds instead of hours, boosting your productivity by an order of magnitude.


🚀 Why MI-Agent?

  • Agentic LangGraph design lets you hit “play” on a full MI pipeline
  • 10× faster: eliminate boilerplate and manual scripting
  • Extensible nodes: swap in your own extractors, metrics, or plots
  • LangSmith-backed for graph tracking, versioning, and observability
  • Production-ready: versionable, testable, pip-installable

🛠️ Prerequisites


Installation via pip

  1. Create & activate a conda environment

    conda create -n mi_agent python=3.10 -y
    conda activate mi_agent
    
  2. Install via pip

    pip install materials_informatics_agent
    
  3. Configure your API keys

    MI-Agent will automatically look for a file named .env in your current working directory (or any parent) and load any keys it finds.

    In the folder where you’ll run the CLI (or in any ancestor), create a file called .env containing:

    OPENAI_API_KEY=sk-…
    LANGCHAIN_API_KEY=lsv2_…
    
  4. Prepare your problem file

    MI-Agent requires a .txt file (an example is provided in the sample_problem.txt in the project root of the source code) which contains:

    • your problem description

    • relative paths to your CSV(s), including any folder prefix (e.g. data/sample_data.csv)

    Example problem.txt:

    You are tasked with predicting alloy strength from composition data...
    
    - data/sample_data_1.csv: Contains experimental results...
    - data/sample_data_2.csv: Contains formulation recipes...
    
  5. Run the agent

    Now, start the mi_agent pipeline as below:

    mi_agent --problem-file <path/to/problem.txt> --output-dir <path/to/output_dir>
    

    MI-Agent will:

    • Identify & load the CSV(s) listed in the problem file
    • Merge files if needed
    • Select target & features
    • Propose & execute EDA
    • Save all generated code (*.py) for EDA analysis and images (*.png) generated during EDA into <output_dir>
    • Run multiple ML models, select top 5, tune hyperparameters, and choose the best model
    • Generate and save a 5-page technical summary into <output_dir>
    • Log all reasoning steps to LangSmith

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

materials_informatics_agent-0.1.24.tar.gz (424.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

materials_informatics_agent-0.1.24-py3-none-any.whl (23.8 kB view details)

Uploaded Python 3

File details

Details for the file materials_informatics_agent-0.1.24.tar.gz.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.24.tar.gz
Algorithm Hash digest
SHA256 7fd7e9224d595550cbe0e479eed35b436fb584d188debf0c9f469fbcd6ad3000
MD5 456ad4bf077037bf4017f702cbd7c1b2
BLAKE2b-256 fda04ac93f51a9ee1092035a594816b3096a5a8edd5a82f3c611ab2d76349840

See more details on using hashes here.

File details

Details for the file materials_informatics_agent-0.1.24-py3-none-any.whl.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.24-py3-none-any.whl
Algorithm Hash digest
SHA256 7aaaf96f1f67636ace8f502cbc549d0405db89803fe591b63fc76c836f0a79a8
MD5 2f2a400bfede5d677bd74a0c5ecbc67f
BLAKE2b-256 50f97720c89dd82040137c47ebd5a749bbecbfed9f5ad5c6b99bca104a603a36

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page