Skip to main content

Empowering Effortless Data Linking.

Project description



linkifile: Empowering Effortless Data Linking

Package PyPI Latest Release PyPI Downloads
Meta Powered by linkifile License - MIT

What is linkifile?

linkifile is a Python package designed to automate the process of populating one column of data by web scraping information from the internet based on the contents of another column. It simplifies the task of linking data columns and enriching your datasets with desired links.

Table of Contents

Main Features

  • Effortlessly link data columns, saving time and effort in manual data enrichment tasks.
  • Utilize web scraping to retrieve relevant information from the web and populate your data columns with valuable insights.
  • Tailor your queries to extract specific data from the web, customizing the data enrichment process.
  • Accelerate the data linking process with built-in multithreading support for faster execution, even with large datasets.
  • Designed with user-friendliness in mind, making it accessible to users of all levels of technical expertise.

Where to get it

The source code is currently hosted on GitHub at: https://github.com/ar8372/linkifile

Binary installers for the latest released version are available at the Python Package Index (PyPI).

Installation

You can install linkifile using pip:

pip install linkifile

Usage

  1. Import the Linker module from the linkifile package.
  2. Create an instance of the Linker class by specifying the source file, column pairs, and optional destination file.
  3. Use the populate method to link data columns based on web scraping queries.

Example:

from linkifile import Linker

# Create a Linker instance with source file, column pairs, and optional destination file
l = Linker(source_file="data.csv", coln_pairs=["Company Name", "Website Link"])

# Populate data columns based on a specific query
l.populate(query="{{x}} official website")
Before After

License

This project is licensed under the MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

linkifile-0.1.4.tar.gz (7.8 kB view details)

Uploaded Source

File details

Details for the file linkifile-0.1.4.tar.gz.

File metadata

  • Download URL: linkifile-0.1.4.tar.gz
  • Upload date:
  • Size: 7.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.5

File hashes

Hashes for linkifile-0.1.4.tar.gz
Algorithm Hash digest
SHA256 e4d3d2067d85e12c483f0f8964052d994a5af44dcd432cfcb44d8a6044b2ff99
MD5 45d560de431b210b50d968ab3803a8e2
BLAKE2b-256 a4e71a7b344d2eaf8db3d74fa5791065d411736479d93a96b0f2579fda7d8651

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page