Skip to main content

A LangGraph/LLM‐driven EDA → AutoML → report pipeline

Project description

MI-Agent

An agentic workflow for materials-informatics (MI) engineers, built with LangGraph and powered by OpenAI models. MI-Agent codifies the end-to-end MI pipeline—data loading, merging, feature selection, EDA, AutoML baselining, hyperparameter tuning, and executive reporting—into reusable nodes orchestrated as a LangGraph. LangSmith integration tracks and visualizes your graph executions. The result? MI workflows that run in seconds instead of hours, boosting your productivity by an order of magnitude.


🚀 Why MI-Agent?

  • Agentic LangGraph design lets you hit “play” on a full MI pipeline
  • 10× faster: eliminate boilerplate and manual scripting
  • Extensible nodes: swap in your own extractors, metrics, or plots
  • LangSmith-backed for graph tracking, versioning, and observability
  • Production-ready: versionable, testable, pip-installable

🛠️ Prerequisites


Installation via pip

  1. Create & activate a conda environment

    conda create -n mi_agent python=3.10 -y
    conda activate mi_agent
    
  2. Install via pip

    pip install materials_informatics_agent
    
  3. Configure your API keys

    MI-Agent will automatically look for a file named .env in your current working directory (or any parent) and load any keys it finds.

    In the folder where you’ll run the CLI (or in any ancestor), create a file called .env containing:

    OPENAI_API_KEY=sk-…
    LANGCHAIN_API_KEY=lsv2_…
    
  4. Prepare your problem file

    MI-Agent requires a .txt file (an example is provided in the sample_problem.txt in the project root of the source code) which contains:

    • your problem description

    • relative paths to your CSV(s), including any folder prefix (e.g. data/sample_data.csv)

    Example problem.txt:

    You are tasked with predicting alloy strength from composition data...
    
    - data/sample_data_1.csv: Contains experimental results...
    - data/sample_data_2.csv: Contains formulation recipes...
    
  5. Run the agent

    Now, start the mi_agent pipeline as below:

    mi_agent --problem-file <path/to/problem.txt> --output-dir <path/to/output_dir>
    

    MI-Agent will:

    • Identify & load the CSV(s) listed in the problem file
    • Merge files if needed
    • Select target & features
    • Propose & execute EDA
    • Save all generated code (*.py) for EDA analysis and images (*.png) generated during EDA into <output_dir>
    • Run multiple ML models, select top 5, tune hyperparameters, and choose the best model
    • Generate and save a 5-page technical summary into <output_dir>
    • Log all reasoning steps to LangSmith

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

materials_informatics_agent-0.1.21.tar.gz (48.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

materials_informatics_agent-0.1.21-py3-none-any.whl (23.2 kB view details)

Uploaded Python 3

File details

Details for the file materials_informatics_agent-0.1.21.tar.gz.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.21.tar.gz
Algorithm Hash digest
SHA256 08462509ad428e52dde83255873c6bcf0ee0ab21ce0853bbdc6c399bfee33f70
MD5 2a860f050d5596ca744d7d09f54a599f
BLAKE2b-256 a63f7dc4ed1911ada011f9580f09e17146e1da990b7940667bec6ebd30b0efdb

See more details on using hashes here.

File details

Details for the file materials_informatics_agent-0.1.21-py3-none-any.whl.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.21-py3-none-any.whl
Algorithm Hash digest
SHA256 0c538b7165eb69ce38c9219bbf2298f891414e047b04d44bbcb59017da0b1f33
MD5 8c8598322f0bcbccadc6f9ace363d708
BLAKE2b-256 3e94c28dedaf6674c1b227142fa3760a3edfcbdd8c492ffad737c0f0e6282df9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page