Skip to main content
Avatar for Luca Soldaini from gravatar.com

Luca Soldaini

Username    soldni
Date joined   Joined

18 projects

dolma

Last released

Data filters

papermage

Last released

Papermage. Casting magic over scientific PDFs.

mmdata

Last released

MMData is a toolkit for curating multimodal datasets.

tartare

Last released

Data filters

tokreate

Last released

Unified APIs for making calls to different LLMs.

quickumls

Last released

QuickUMLS is a tool for fast, unsupervised biomedical concept extraction from medical text

smashed

Last released

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.

decontext

Last released

Pipeline for decontextualization of scientific snippets.

springs

Last released

A set of utilities to create and manage typed configuration files effectively, built on top of OmegaConf.

necessary

Last released

Python package to enforce optional dependencies

shadow-scholar

Last released

🎓🕶️ A collection of utilities and demos from the Semantic Scholar Research Team 🕶️🎓

mmda

Last released

MMDA - multimodal document analysis

trouting

Last released

Trouting (short for Type Routing) is a simple class decorator that allows to define multiple interfaces for a method that behave differently depending on input types.

pyterrier-sentence-transformers

Last released

Create an pyterrier index using any sentence-transformers model

scipdf

Last released

multimodal document analysis

espresso-config

Last released

A struct config parser that you can set up in the

Minimal-Server

Last released

Serve a python object through a simple socket; supports multiple connections.

quickumls-simstring

Last released

Clone of simstring designed to work with QuickUMLS. Original version here: http://chokkan.org/software/simstring/

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page