Python module to implement API-like functionality for the FurAffinity.net website.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

FurAffinity API

Python module to implement API-like functionality for the FurAffinity.net website

Requirements

Python 3.8+ is necessary to run this module.

This module requires the following pypi modules:

Usage

The API is comprised of a main class FAAPI and two submission classes: Sub and SubPartial. Once FAAPI is initialized its method can be used to crawl FA and return machine-readable objects.

import faapi
import json

api = faapi.FAAPI()
sub, sub_file = api.get_sub(12345678, get_file=True)

print(sub.id, sub.title, sub.author, f"{len(sub_file)/1024:02f}KiB")

with open(f"{sub.id}.json", "w") as f:
    f.write(json.dumps(sub))

with open(sub.file_url.split("/")[-1], "wb") as f:
    f.write(sub_file)

gallery, _ = api.gallery("user_name", 1)
with open("user_name-gallery.json", "w") as f:
    f.write(json.dumps(gallery))

robots.txt

At init, the FAAPI object downloads the robots.txt file from FA to determine the Crawl-delay value set therein. If not set, a value of 1 second is used.

To respect this value, the default behaviour of of the FAAPI object is to wait when a get request is made if the last request was performed more recently then the crawl delay value.

See under FAAPI for more details on this behaviour.

Furthermore, any get operation that points to a disallowed path from robots.txt will raise an exception. This check should not be circumvented and the developer of this module does not take responsibility for violations of the TOS of FurAffinity.

Cookies

To access protected pages, cookies from an active session are needed. These cookies must be given to the FAAPI object as a list of dictionaries, each containing a name and a value field. The cookies list should look like the following random example:

cookies = [
    {"name": "a", "value": "38565475-3421-3f21-7f63-3d341339737"},
    {"name": "b", "value": "356f5962-5a60-0922-1c11-65003b703038"},
]

Only cookies a and b are needed.

To access session cookies, consult the manual of the browser used to login.

Note: it is important to not logout of the session the cookies belong, otherwise they will no longer work.

FAAPI

This is the main object that handles all the calls to scrape pages and get submissions.

It holds 6 different fields:

cookies: List[dict] = [] cookies passed at init
session: CloudflareScraper cfscrape session used for get requests
robots: Dict[str, List[str]] robots.txt values
crawl_delay: float crawl delay from robots.txt, else 1
last_get: float time of last get (not UNIX time, uses time.perf_counter for more precision)
raise_for_delay: bool = False if set to True, raises an exception if a get call is made before enough time has passed

Init

__init__(cookies: List[dict] = None)

The class init has a single optional argument cookies necessary to read logged-in-only pages. The cookies can be omitted and the API will still be able to access public pages.

Note: Cookies must be in the format mentioned above in #Cookies.

Methods

load_cookies(cookies: List[Dict[str, Any]])
Load new cookies in the object and remake the CloudflareScraper session.
get(path: str, **params) -> requests.Response
This returns a response object containing the result of the get operation on the given url with the optional **params added to it (url provided is considered as path from 'https://www.furaffinity.net/').
get_parse(path: str, **params) -> bs4.BeautifulSoup
Similar to get() but returns the parsed HTML from the normal get operation.
get_sub(sub_id: int, get_file: bool = False) -> Tuple[Sub, Optional[bytes]]
Given a submission ID in either int or str format, it returns a Sub object containing the various metadata of the submission itself and a bytes object with the submission file if get_file is passed as True.
get_sub_file(sub: Sub) -> Optional[bytes]
Given a submission object, it downloads its file and returns it as a bytes object.
userpage(user: str) -> Tuple[str, str, bs4.BeautifulSoup]
Returns the user's full display name - i.e. with capital letters and extra characters such as "_" -, the user's status - the first character found beside the user name - and the parsed profile text in HTML.
gallery(user: str, page: int = 1) -> Tuple[List[SubPartial], int]
Returns the list of submissions found on a specific gallery page and the number of the next page. The returned page number is set to 0 if it is the last page.
scraps(user: str, page: int = 1) -> -> Tuple[List[SubPartial], int]
Same as gallery(), but scrapes a user's scraps page instead.
favorites(user: str, page: str = '') -> Tuple[List[SubPartial], str]
As gallery() and scraps() it downloads a user's favorites page. Because of how favorites pages work on FA, the page argument (and the one returned) are strings. If the favorites page is the last then an empty string is returned as next page. An empty page value as argument is equivalent to page 1.
Note: favorites page "numbers" do not follow any scheme and are only generated server-side.
search(q: str = '', page: int = 0, **params) -> Tuple[List[SubPartial], int, int, int, int]
Parses FA search given the query (and optional other params) and returns the submissions found and the next page together with basic search statistics: the number of the first submission in the page, the number of the last submission in the page (0-indexed), and the total number of submissions found in the search. For example if the the last three returned integers are 1, 47 and 437, then the the page contains submissions 1 through 48 of a search that has found a total of 437 submissions.
Note: as of 2020-08-01 the "/search" path is disallowed by FA's robots.txt.

SubPartial

This lightweight submission object is used to contain the information gathered when parsing gallery, scraps, favorites and search pages. It contains only the following fields:

id submission id
title submission title
author submission author
rating submission rating [general, mature, adult]
type submission type [text, image, etc...]

SubPartial can be directly casted to a dict object or iterated through.

Init

__init__(sub_figure: bs4.element.Tag)

SubPartial init needs a figure tag taken from a parsed page. This tag is not saved in the submission object.

Methods

parse_figure_tag(figure_tag: bs4.element.Tag)
Takes a figure tag from a parsed page and parses it for information.

Sub

The main class that parses and holds submission metadata.

id submission id
title submission title
author submission author
date upload date in YYYY-MM-DD format
tags tags list
category category *
species species *
gender gender *
rating rating *
description the description as an HTML formatted string
file_url the url to the submission file

* these are extracted exactly as they appear on the submission page

Sub can be directly casted to a dict object and iterated through.

Init

__init__(self, sub_page: bs4.BeautifulSoup = None)

To initialise the object, An optional bs4.BeautifulSoup object is needed containing the parsed HTML of a submission page.

The sub_page argument is saved as an instance variable and is then parsed to obtain the other fields.

If no sub_page is passed then the object fields will remain at their default - empty - value.

Methods

parse_page()
Parses the stored submission page for metadata.

Examples

To see examples of the various functionalities of the module, use the scripts located in the examples folder. These can be run directly, provided the requirements are met.

See the specific scripts for more details.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

3.11.4

Oct 13, 2023

3.11.3

Jul 23, 2023

3.11.2

Jul 23, 2023

3.11.1

Jun 2, 2023

3.11.0

May 24, 2023

3.10.1

Apr 3, 2023

3.10.0

Nov 27, 2022

3.9.6

Nov 18, 2022

3.9.5

Nov 4, 2022

3.9.4

Nov 2, 2022

3.9.3

Nov 2, 2022

3.9.2

Oct 31, 2022

3.9.1

Oct 25, 2022

3.9.0

Sep 14, 2022

3.8.1

Aug 30, 2022

3.8.0

Aug 18, 2022

3.7.4

Jul 14, 2022

3.7.3

Jun 5, 2022

3.7.2

Jun 4, 2022

3.7.1

May 24, 2022

3.7.0

Apr 30, 2022

3.6.1

Apr 5, 2022

3.6.0

Apr 2, 2022

3.5.0

Mar 26, 2022

3.4.3

Mar 3, 2022

3.4.2

Feb 11, 2022

3.4.1

Jan 28, 2022

3.4.0

Jan 27, 2022

3.3.8 yanked

Jan 27, 2022

Reason this release was yanked:

Incorrect version bump

3.3.7

Jan 19, 2022

3.3.6

Jan 18, 2022

3.3.5

Jan 16, 2022

3.3.4

Jan 13, 2022

3.3.3

Jan 7, 2022

3.3.2

Dec 29, 2021

3.3.1

Dec 29, 2021

3.3.0

Dec 28, 2021

3.2.0

Dec 28, 2021

3.1.2

Dec 27, 2021

3.1.1

Dec 17, 2021

3.1.0

Dec 14, 2021

3.0.2

Dec 6, 2021

3.0.1

Dec 5, 2021

3.0.0

Dec 5, 2021

2.19.4

Dec 3, 2021

2.19.3

May 20, 2021

2.19.2

May 20, 2021

2.19.1

Apr 24, 2021

2.19.0

Apr 23, 2021

2.18.1

Apr 22, 2021

2.18.0

Apr 21, 2021

2.17.4

Apr 13, 2021

2.17.3

Apr 13, 2021

2.17.2

Apr 13, 2021

2.17.1

Mar 12, 2021

2.17.0

Mar 12, 2021

2.16.1

Mar 12, 2021

2.16.0

Mar 12, 2021

2.15.3

Mar 10, 2021

2.15.2

Feb 22, 2021

2.15.1

Feb 21, 2021

2.15.0

Feb 21, 2021

2.14.0

Feb 19, 2021

2.13.1

Jan 28, 2021

2.13.0

Jan 24, 2021

2.12.0

Jan 17, 2021

2.11.5

Jan 9, 2021

2.11.4

Jan 1, 2021

2.11.3 yanked

Jan 1, 2021

Reason this release was yanked:

Incorrect __version__ attribute

2.11.2

Dec 29, 2020

2.11.1

Nov 4, 2020

2.11.0

Nov 2, 2020

2.10.2

Oct 14, 2020

2.10.1

Oct 10, 2020

2.10.0

Oct 9, 2020

2.9.1

Oct 9, 2020

2.9.0

Oct 8, 2020

2.8.4

Oct 7, 2020

2.8.3

Sep 22, 2020

2.8.2

Sep 19, 2020

2.8.1

Aug 16, 2020

2.8.0

Aug 16, 2020

2.7.3

Aug 16, 2020

2.7.2

Aug 16, 2020

2.7.1

Aug 15, 2020

2.7.0

Aug 15, 2020

2.6.0

Aug 12, 2020

2.5.0

Aug 6, 2020

2.4.1

Aug 5, 2020

This version

2.4.0

Aug 4, 2020

2.3.4

Aug 3, 2020

2.3.3

Aug 3, 2020

2.3.2

Aug 3, 2020

2.3.1

Aug 3, 2020

2.3.0

Aug 3, 2020

2.2.1

Aug 3, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

faapi-2.4.0.tar.gz (16.5 kB view hashes)

Uploaded Aug 4, 2020 Source

Built Distribution

faapi-2.4.0-py3-none-any.whl (14.3 kB view hashes)

Uploaded Aug 4, 2020 Python 3

Hashes for faapi-2.4.0.tar.gz

Hashes for faapi-2.4.0.tar.gz
Algorithm	Hash digest
SHA256	`688f7bf77a5b97f56bb2965170e24b86916c505999d25b90fe0361a13693bb22`
MD5	`321138b10a334e3713cf18182598b7b0`
BLAKE2b-256	`3d9b706e680b935a462dd9a2ad0024974e1c0b66566e0ada57098c20b74c8eb7`

Hashes for faapi-2.4.0-py3-none-any.whl

Hashes for faapi-2.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6ec5c6f28d47137c967e2b13fad49000df31c11840563effd0c0eb59d7ab4f1c`
MD5	`3e8e2b027b673ac4461369cb7dcb900e`
BLAKE2b-256	`2b92fd105db442012c143d9acd81322a72c3209c24a318bdc306ade90d6181b5`