Skip to main content

A LangGraph/LLM‐driven EDA → AutoML → report pipeline

Project description

MI-Agent

An agentic workflow for materials-informatics (MI) engineers, built with LangGraph and powered by OpenAI models. MI-Agent codifies the end-to-end MI pipeline—data loading, merging, feature selection, EDA, AutoML baselining, hyperparameter tuning, and executive reporting—into reusable nodes orchestrated as a LangGraph. LangSmith integration tracks and visualizes your graph executions. The result? MI workflows that run in seconds instead of hours, boosting your productivity by an order of magnitude.


🚀 Why MI-Agent?

  • Agentic LangGraph design lets you hit “play” on a full MI pipeline
  • 10× faster: eliminate boilerplate and manual scripting
  • Extensible nodes: swap in your own extractors, metrics, or plots
  • LangSmith-backed for graph tracking, versioning, and observability
  • Production-ready: versionable, testable, pip-installable

🛠️ Prerequisites


Installation via pip

  1. Create & activate a conda environment

    conda create -n mi_agent python=3.10 -y
    conda activate mi_agent
    
  2. Install via pip

    pip install materials_informatics_agent
    
  3. Configure your API keys

    MI-Agent will automatically look for a file named .env in your current working directory (or any parent) and load any keys it finds.

    In the folder where you’ll run the CLI (or in any ancestor), create a file called .env containing:

    OPENAI_API_KEY=sk-…
    LANGCHAIN_API_KEY=lsv2_…
    
  4. Prepare your problem file

    MI-Agent requires a .txt file (an example is provided in the sample_problem.txt in the project root of the source code) which contains:

    • your problem description

    • relative paths to your CSV(s), including any folder prefix (e.g. data/sample_data.csv)

    Example problem.txt:

    You are tasked with predicting alloy strength from composition data...
    
    - data/sample_data_1.csv: Contains experimental results...
    - data/sample_data_2.csv: Contains formulation recipes...
    
  5. Run the agent

    Now, start the mi_agent pipeline as below:

    mi_agent --problem-file <path/to/problem.txt> --output-dir <path/to/output_dir>
    

    MI-Agent will:

    • Identify & load the CSV(s) listed in the problem file
    • Merge files if needed
    • Select target & features
    • Propose & execute EDA
    • Save all generated code (*.py) for EDA analysis and images (*.png) generated during EDA into <output_dir>
    • Run multiple ML models, select top 5, tune hyperparameters, and choose the best model
    • Generate and save a 5-page technical summary into <output_dir>
    • Log all reasoning steps to LangSmith

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

materials_informatics_agent-0.1.17.tar.gz (48.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

materials_informatics_agent-0.1.17-py3-none-any.whl (22.9 kB view details)

Uploaded Python 3

File details

Details for the file materials_informatics_agent-0.1.17.tar.gz.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.17.tar.gz
Algorithm Hash digest
SHA256 47b183ecb111ba324e9932dfa097c6b788c42b883773f28c4991db29f0069ece
MD5 53245d661c6cb59f98e08f23649aa2f1
BLAKE2b-256 229a97dc5d0dfa47a2210075c7fd79d9c2e671a6877785452fca10519a34c107

See more details on using hashes here.

File details

Details for the file materials_informatics_agent-0.1.17-py3-none-any.whl.

File metadata

File hashes

Hashes for materials_informatics_agent-0.1.17-py3-none-any.whl
Algorithm Hash digest
SHA256 99f872695bcb421d3136b4e2b014b8d6912c9bc5c3299a637a870b35e52eb733
MD5 2d01c998034413bba718afbf6c54c0ea
BLAKE2b-256 3d66f93bdb2396338250ad3930b2813cebe71689181b6ebdc1d3101ca18feaa2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page