Skip to main content

Orange3 widget for fetching articles from Il Post

Project description

Orange3 Il Post Widget

An Orange3 add-on widget for fetching articles, podcasts, and newsletters from the Italian online newspaper Il Post. The widget queries the Il Post search API and outputs a Corpus ready for text mining workflows.

User Interface

Il Post Orange Widget GUI

Installation

pip install orange3-ilpost

The package requires Orange3 and orange3-text to be installed. The Il Post API wrapper (ilpost-api-wrapper) is installed automatically as a dependency.

Usage

After installation, the Il Post category will appear in the Orange Canvas widget toolbox.

  1. Drag the Il Post widget onto the canvas.
  2. Type a search query in the Query field and press Enter or click Search.
  3. Adjust the filters as needed (see below).
  4. Connect the widget output to any text mining widget (e.g. Corpus Viewer, Word Cloud, Topic Modelling).

Controls

Control Description
Query Search term. Keeps a history of recent queries.
Content type Filter by All, Articles, Podcasts, or Newsletters.
Date range Filter by All time, Past year, or Past 30 days.
Sort by Sort results by Relevance, Newest, or Oldest.
Category Optional editorial category filter (e.g. politica, cultura). Applies to articles only.
Max documents Maximum number of results to retrieve (10–1000).
Include paywalled content When unchecked, subscriber-only results are excluded.
Text includes Choose which fields are used as text features for analysis: Title, Summary, Highlight, Category, Tags.

Output

The widget outputs a Corpus with the following metadata columns:

Field Type Description
Title String Article/episode title
Summary String Short description
Highlight String Search snippet with matched terms
Category String Editorial category
Tags String Comma-separated topic tags
Type String Content type (post, episodes, newsletter)
Publication Date Time Publication timestamp (Italian local time)
URL String Link to the full content
Relevance Score Continuous Search relevance score (0.0 when sorted by date)

Example workflow

[Il Post] → [Corpus Viewer]
[Il Post] → [Word Cloud]
[Il Post] → [Topic Modelling]
[Il Post] → [Sentiment Analysis]

Requirements

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orange3_ilpost-0.1.0.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orange3_ilpost-0.1.0-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file orange3_ilpost-0.1.0.tar.gz.

File metadata

  • Download URL: orange3_ilpost-0.1.0.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for orange3_ilpost-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f6200e7f289b96f3cda57ba7cab920ac3f4c44ec9cebce2b6b9b957b62ad57a1
MD5 1ff83e1b9961bf3a5c54f4ebfc5a9295
BLAKE2b-256 fb0f828964082c5526e77667ddb0ce56f471c3a6b344c2d609f3e62b6384166a

See more details on using hashes here.

File details

Details for the file orange3_ilpost-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: orange3_ilpost-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for orange3_ilpost-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2e836587337b4c92e0ba15530fc4ceffa90c6d5e9dd3d13edafcbbd63eac9a8a
MD5 8defcd9b4c52b496bd6046112cc417a1
BLAKE2b-256 f3aef544ac9a7db7dcb38c25e2a1f4b104d24738e184b6307c6939182a1b9cd3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page