Skip to main content

Orange3 widget for fetching articles from Il Post

Project description

Orange3 Il Post Widget

An Orange3 add-on widget for fetching articles, podcasts, and newsletters from the Italian online newspaper Il Post. The widget queries the Il Post search API and outputs a Corpus ready for text mining workflows.

User Interface

Il Post Orange Widget GUI

Installation

pip install orange3-ilpost

The package requires Orange3 and orange3-text to be installed. The Il Post API wrapper (ilpost-api-wrapper) is installed automatically as a dependency.

Usage

After installation, the Il Post category will appear in the Orange Canvas widget toolbox.

  1. Drag the Il Post widget onto the canvas.
  2. Type a search query in the Query field and press Enter or click Search.
  3. Adjust the filters as needed (see below).
  4. Connect the widget output to any text mining widget (e.g. Corpus Viewer, Word Cloud, Topic Modelling).

Controls

Control Description
Query Search term. Keeps a history of recent queries.
Content type Filter by All, Articles, Podcasts, or Newsletters.
Date range Filter by All time, Past year, or Past 30 days.
Sort by Sort results by Relevance, Newest, or Oldest.
Category Optional editorial category filter (e.g. politica, cultura). Applies to articles only.
Max documents Maximum number of results to retrieve (1–1000, capped at 100 when Content is selected).
Include paywalled content When unchecked, subscriber-only results are excluded.
Text includes Choose which fields are included in the output corpus and used as text features for analysis. Use Select All to check or uncheck all fields at once. Selecting Content (max 100 articles) also downloads the full article body (capped at 100 documents).

Output

The widget outputs a Corpus with the following metadata columns:

Field Type Description
Title String Article/episode title
Summary String Short description
Content String Full article body (only when Content is selected in Text includes)
Highlight String Search snippet with matched terms
Category String Editorial category
Tags String Comma-separated topic tags
Type String Content type (post, episodes, newsletter)
Publication Date Time Publication timestamp (Italian local time)
URL String Link to the full content
Relevance Score Continuous Search relevance score (0.0 when sorted by date)

Only the columns selected in Text includes appear in the output corpus (Publication Date and Relevance Score are always included).

Example workflow

[Il Post] → [Corpus Viewer]
[Il Post] → [Word Cloud]
[Il Post] → [Topic Modelling]
[Il Post] → [Sentiment Analysis]

Requirements

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orange3_ilpost-0.2.0.tar.gz (8.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orange3_ilpost-0.2.0-py3-none-any.whl (9.1 kB view details)

Uploaded Python 3

File details

Details for the file orange3_ilpost-0.2.0.tar.gz.

File metadata

  • Download URL: orange3_ilpost-0.2.0.tar.gz
  • Upload date:
  • Size: 8.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for orange3_ilpost-0.2.0.tar.gz
Algorithm Hash digest
SHA256 642bd6a3b2acdc96f0d1a8dd30c503945c0f7e2832179007d614dde4d2e93157
MD5 247835581501c42a3b1e61de583dc303
BLAKE2b-256 a47393767363d26a57c6c33bd04a37377bccb6f1fb790f81eb5ce62f9fd38cb8

See more details on using hashes here.

File details

Details for the file orange3_ilpost-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: orange3_ilpost-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 9.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for orange3_ilpost-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ca36b4bb779a49f3b872dba0549cf2bb08dc3c36775168cb7a967a88486fa0a6
MD5 c65c887bfc273196da1c86f8b73a33d1
BLAKE2b-256 e1bdd403d57357d40046ccef3b508b3d84c9bbf5a24df031194a3e9aa87bb9ea

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page