Skip to main content

Parser for easy data retrieval from cian.ru

Project description

Cian Parser

What is this?

This is a parser with which you can easily get data from a website cian.ru.

Quick Guide

This module is based on Selenium-Stealth, using BeautifulSoup as well as Asyncio

Data you can get:

  1. Name of apartment
  2. The city district in which the housing estate is located
  3. Price of the apartment
  4. Time to the subway
  5. How to get to the subway
  6. Nearest subway
  7. Price per square meter
  8. Total square footage
  9. Living Space
  10. Floor
  11. Number of stories in the house
  12. Year of delivery of the house
  13. Surrendered or not
  14. Finishing
  15. Parking
  16. Ceiling Heights
  17. Builder Rating

Using

Using the library is as simple and convenient as possible:

Let's import it first: First, import everything from the library (use the from ... import * construct).

Examples of all operations:

Сreate an instance of a class Cian_Parser (PATH - file save path, URL - site url, BOOST (True or False) - you can also receive a seller rating, but the speed is reduced several times, COUNT_PAGE - how many pages of apartments do you want to receive):

parser = Cian_Parser(PATH, URL, BOOST, COUNT_PAGE)

Receive all data of all apartments in CSV format using the start_parsing() function:

parser.start_parsing()

If you want to create your own parser logic, then use the description of the other modules:

Сreate an instance of a class Pagination (parser - parser instance from the Flats_Url class, next_button_selector - XPATH pagination buttons)

pagination = Pagination(parser, next_button_selector)

Checking for next page using the HasNextPage() function:

await pagination.HasNextPage()

Go to next page using the GoToTheNextPage() function:

await pagination.GoToTheNextPage()

Developer

My site: link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parser_cian-0.1.6.tar.gz (7.1 kB view details)

Uploaded Source

File details

Details for the file parser_cian-0.1.6.tar.gz.

File metadata

  • Download URL: parser_cian-0.1.6.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for parser_cian-0.1.6.tar.gz
Algorithm Hash digest
SHA256 ef8ef6d2f4eb8e413fe5e5dfc475d370fd5870782e86f4a14fa22a07e6148884
MD5 ba20e17058ef47339002e18e9ac63e08
BLAKE2b-256 e32ed52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page