Official implementation of NAVER.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ControlNet

These details have not been verified by PyPI

Project description

NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning

This repo is the official implementation for the paper NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning in ICCV 2025.

Release

[2025/06/28] 🔥 NAVER code is open sourced in GitHub.
[2025/06/25] 🎉 NAVER paper is accepted by ICCV 2025.

TODOs

We're working on the following TODOs:

GUI demo.
Support more LLMs.
Video demo & slides presentation.

Docker (GUI Demo)

We provide a Docker image for the GUI demo.

docker run --runtime=nvidia --gpus=all -p <GUI-PORT>:8000 -e OPENAI_API_KEY=<OPENAI-API-KEY> -e AZURE_OPENAI_URL=<AZURE-OPENAI-URL> controlnet/naver:latest

The GUI will be available at http://0.0.0.0:<GUI-PORT>. See details below for more information.

Installation

Requirements

Python >= 3.10
conda

Please follow the instructions below to install the required packages and set up the environment.

1. Clone this repository.

git clone https://github.com/ControlNet/NAVER

2. Setup conda environment and install dependencies.

Option 1: Using pixi (recommended):

pixi install
pixi shell

Option 2: Building from source (You may need to setup the CUDA and PyTorch manually):

conda install conda-forge/label/rust_dev::rust=1.78 -c conda-forge -y
pip install "git+https://github.com/scallop-lang/scallop.git@f8fac18#egg=scallopy&subdirectory=etc/scallopy"
pip install -e .

3. Configure the environments

Edit the file .env or setup in CLI to configure the environment variables.

OPENAI_API_KEY=your-api-key  # if you want to use OpenAI LLMs
AZURE_OPENAI_URL= # if you want to use Azure OpenAI LLMs
OLLAMA_HOST=http://ollama.server:11434  # if you want to use your OLLaMA server for llama or deepseek
# do not change this TORCH_HOME variable
TORCH_HOME=./pretrained_models

4. Download the pretrained models

Run the scripts to download the pretrained models to the ./pretrained_models directory.

python -m hydra_vl4ai.download_model --base_config config/refcoco.yaml --model_config config/model_config.yaml --extra_packages naver.tool

Inference

You may need 28GB vRAM to run NAVER. Consider editing the file in ./config/model_config.yaml to load the models in multiple GPUs.

Inference with GUI

You need nodejs and npm to run the GUI demo. It will automatically compile and build the frontend.

The GUI will be available at http://0.0.0.0:8000.

python demo_gui.py \
  --base_config <YOUR-CONFIG-DIR> \
  --model_config <MODEL-CONFIG-PATH>

gui_preview

Inference with given one image and query

python demo_cli.py \
  --image <IMAGE_PATH> \
  --query <QUERY> \
  --base_config <YOUR-CONFIG-DIR> \
  --model_config <MODEL-CONFIG-PATH>

The result will be printed in the console.

Inference dataset

python main.py \
  --data_root <YOUR-DATA-ROOT> \
  --base_config <YOUR-CONFIG-DIR> \
  --model_config <MODEL-CONFIG-PATH>

Then the inference results are saved in the ./result directory for evaluation.

Evaluation

python evaluate.py --input <RESULT_JSONL_PATH>

The evaluation results will be printed in the console. Note the output from LLM is random, so the evaluation results may be slightly different from the paper.

Citation

If you find this work useful for your research, please consider citing it.

@article{cai2025naver,
  title = {NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning},
  author = {Cai, Zhixi and Ke, Fucai and Jahangard, Simindokht and Garcia de la Banda, Maria and Haffari, Reza and Stuckey, Peter J. and Rezatofighi, Hamid},
  journal = {arXiv preprint arXiv:2502.00372},
  year = {2025},
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ControlNet

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.2

Sep 7, 2025

0.0.1

Jun 28, 2025

0.0.0

Jun 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

naver-0.0.2.tar.gz (29.6 kB view details)

Uploaded Sep 7, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

naver-0.0.2-py3-none-any.whl (36.8 kB view details)

Uploaded Sep 7, 2025 Python 3

File details

Details for the file naver-0.0.2.tar.gz.

File metadata

Download URL: naver-0.0.2.tar.gz
Upload date: Sep 7, 2025
Size: 29.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for naver-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`9ccf61d5c0a7af34781e655539fbb6c9790e822a214916dd6d93f17ce8e441a0`
MD5	`0291713d56e09f5d5ab2262666166a51`
BLAKE2b-256	`9061caa284b476e82b7ebf1da12d1f27e153f075f0b24ca7d3e6ec35b608b741`

See more details on using hashes here.

Provenance

The following attestation bundles were made for naver-0.0.2.tar.gz:

Publisher: release.yml on ControlNet/NAVER

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: naver-0.0.2.tar.gz
- Subject digest: 9ccf61d5c0a7af34781e655539fbb6c9790e822a214916dd6d93f17ce8e441a0
- Sigstore transparency entry: 481894023
- Sigstore integration time: Sep 7, 2025
Source repository:
- Permalink: ControlNet/NAVER@0b6372f3527e3b3c311f9f5bb3bbd44289bf260e
- Branch / Tag: refs/heads/master
- Owner: https://github.com/ControlNet
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@0b6372f3527e3b3c311f9f5bb3bbd44289bf260e
- Trigger Event: push

File details

Details for the file naver-0.0.2-py3-none-any.whl.

File metadata

Download URL: naver-0.0.2-py3-none-any.whl
Upload date: Sep 7, 2025
Size: 36.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for naver-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`39cf0688098da0e849b8b425a6c8d6c519dcbacd02355504334e288d25d02b3d`
MD5	`82934b7755f6544f9e7634702ac52b80`
BLAKE2b-256	`0721619f0d86a4f7a42ed1c26ce7c5f71e8beabefca6c0082c4751f9ba3abf15`

See more details on using hashes here.

Provenance

The following attestation bundles were made for naver-0.0.2-py3-none-any.whl:

Publisher: release.yml on ControlNet/NAVER

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: naver-0.0.2-py3-none-any.whl
- Subject digest: 39cf0688098da0e849b8b425a6c8d6c519dcbacd02355504334e288d25d02b3d
- Sigstore transparency entry: 481894025
- Sigstore integration time: Sep 7, 2025
Source repository:
- Permalink: ControlNet/NAVER@0b6372f3527e3b3c311f9f5bb3bbd44289bf260e
- Branch / Tag: refs/heads/master
- Owner: https://github.com/ControlNet
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@0b6372f3527e3b3c311f9f5bb3bbd44289bf260e
- Trigger Event: push

naver 0.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning

Release

TODOs

Docker (GUI Demo)

Installation

Requirements

1. Clone this repository.

2. Setup conda environment and install dependencies.

3. Configure the environments

4. Download the pretrained models

Inference

Inference with GUI

Inference with given one image and query

Inference dataset

Evaluation

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance