Skip to main content

Downloads and formats search results from dblp

Project description

# dblp-pub
A scraper that downloads search results from DBLP. You can search for any list of keywords.

Scraping dblp is allowed by them and they even offer an API for it. The API, however, is restricted to the search for authors.
This is why I wrote this scraper, which allows you to also search for publications.

## Installation

Just type "pip install dblp-pub" in your terminal and it will get installed. You can also clone this repository and type "python setup.py install" after switching to the downloaded folder.

## How to use it?

After importing it, you can use the search()-function to search in dblp for any array of key words. The result will be a pandas DataFrame.

## Example

Imagine you are interested research in Collaborative Writing and want to evaluate which conferences people who write about it publish to.
With this library you get a table for every search request, so you can easily compare results. Below is an example for one of these tables
and the lookup for the search phrase "Collaborative Writing".

```python
import dblp-pub as dblp

results = dblp.search(["Collaborative Writing"])

results.head()
```
The Code above yields the following results:


|Authors | Link | Title |Type | Where |Year |
|------------------------|:--------------------:|:--------------------------------:|:-----------:|:--------:|----:|
| [Lekha Limbu, Lina M...| http://dx.doi.org/...|How do learners experience join...|article |Compute...|2015 |
| [Sebastian Gehrmann,...| http://dx.acm.org/...|Deploying AI Methods to Support...|inproceedings|CHI Ext...|2015 |
| [Dakuo Wang, Judith ...| http://dx.acm.org/...|DocuViz: Visualizing Collaborat...|inproceedings|CHI |2015 |
| [Oluwabunmi Adewoyin...| http://dx.doi.org/...|Exploiting the Use of Wikis to ...|inproceedings|CRIWG |2015 |
| [Menghui Li, Young M...| http://dx.doi.org/...|An Exploration of Mobile Collab...|inproceedings|HCI (24...|2015 |

Now imagine, you want to look for the papers with "AI" in the title. You can easily do that using pandas:

```python
results[results['Title'].str.contains("AI")]
```

Given the search query above, this yield one paper with the following link: http://doi.acm.org/10.1145/2702613.2732705

## Licence

This code is published under the MIT licence. Feel free to improve and redistribute my code. I hope I made your life a little bit easier.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dblp-0.1.0.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

dblp-0.1.0-py2-none-any.whl (5.4 kB view details)

Uploaded Python 2

File details

Details for the file dblp-0.1.0.tar.gz.

File metadata

  • Download URL: dblp-0.1.0.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dblp-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ff66a040a15294f29ac3533e77ee242e3880bfe0c3d1a9fee2a772cc7a37e009
MD5 f2f86464f0cbe5ea2cf0175a079b9416
BLAKE2b-256 08dc6c3da684cd05008d13159ed792120dda1d373ee6cc15344cc82b1e1f6d0f

See more details on using hashes here.

File details

Details for the file dblp-0.1.0-py2-none-any.whl.

File metadata

  • Download URL: dblp-0.1.0-py2-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dblp-0.1.0-py2-none-any.whl
Algorithm Hash digest
SHA256 1dd848a1c5b1161f23c6f263109f4ed3ccd3c24c3dad7143dcc1f7537a279127
MD5 9e2aa4f78671652395e0c5cec57c1722
BLAKE2b-256 ab50f862b0c09ede5ba8f45b43a5c173f33c3d9c2eba42c6da29fb7d3acf6a85

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page