Skip to main content

Orange3 widget for fetching articles from Il Post

Project description

Orange3 Il Post Widget

An Orange3 add-on widget for fetching articles, podcasts, and newsletters from the Italian online newspaper Il Post. The widget queries the Il Post search API and outputs a Corpus ready for text mining workflows.

User Interface

Il Post Orange Widget GUI

Installation

pip install orange3-ilpost

The package requires Orange3 and orange3-text to be installed. The Il Post API wrapper (ilpost-api-wrapper) is installed automatically as a dependency.

Usage

After installation, the Il Post category will appear in the Orange Canvas widget toolbox.

  1. Drag the Il Post widget onto the canvas.
  2. Type a search query in the Query field and press Enter or click Search.
  3. Adjust the filters as needed (see below).
  4. Connect the widget output to any text mining widget (e.g. Corpus Viewer, Word Cloud, Topic Modelling).

Controls

Control Description
Query Search term. Keeps a history of recent queries.
Content type Filter by All, Articles, Podcasts, or Newsletters.
Date range Filter by All time, Past year, or Past 30 days.
Sort by Sort results by Relevance, Newest, or Oldest.
Category Optional editorial category filter (e.g. politica, cultura). Applies to articles only.
Max documents Maximum number of results to retrieve (1–1000, capped at 100 when Content is selected).
Include paywalled content When unchecked, subscriber-only results are excluded.
Text includes Choose which fields are included in the output corpus and used as text features for analysis. Use Select All to check or uncheck all fields at once. Selecting Content (max 100 articles) also downloads the full article body (capped at 100 documents).

Output

The widget outputs a Corpus with the following metadata columns:

Field Type Description
Title String Article/episode title
Summary String Short description
Content String Full article body (only when Content is selected in Text includes)
Highlight String Search snippet with matched terms
Category String Editorial category
Tags String Comma-separated topic tags
Type String Content type (post, episodes, newsletter)
Publication Date Time Publication timestamp (Italian local time)
URL String Link to the full content
Relevance Score Continuous Search relevance score (0.0 when sorted by date)

Only the columns selected in Text includes appear in the output corpus (Publication Date and Relevance Score are always included).

Example workflow

[Il Post] → [Corpus Viewer]
[Il Post] → [Word Cloud]
[Il Post] → [Topic Modelling]
[Il Post] → [Sentiment Analysis]

Requirements

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orange3_ilpost-0.2.1.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orange3_ilpost-0.2.1-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file orange3_ilpost-0.2.1.tar.gz.

File metadata

  • Download URL: orange3_ilpost-0.2.1.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for orange3_ilpost-0.2.1.tar.gz
Algorithm Hash digest
SHA256 1611851ba08dfd154781b73af26ce15943cc612d09cb065eef3e6b34dc462a56
MD5 6db81fcb5899d11752f7034b26dea311
BLAKE2b-256 eb101ca3ca6d7d95ad7bc998c6c6765a3a1116f9529ed3105549222c5c8e85b3

See more details on using hashes here.

File details

Details for the file orange3_ilpost-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: orange3_ilpost-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for orange3_ilpost-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 419226982d40d427000c32376dca2aebcba0de6c0727d25f23135cf57821df48
MD5 18c759e1f67046cd0e3e10a6f783acdf
BLAKE2b-256 fdf267de99e3698f0e223ae8776602c6b409437b3b6adcc8e32ffa4a6982270d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page