Common lib used by Dipoankar
Project description
Example of code:
from pydipankar import Spider
res =[]
for i in range(1,4):
url = 'http://www.geeksforgeeks.org/category/matrix/page/%s/'%i
soup = Spider.buildSoup(url);
res += Spider.getAttrListForXPath(soup,'div#content article',None,{'entry-title':['.entry-title a','text'],'entry-summary':['.entry-summary','text']})
f = open('a.txt','w+')
ii =0
for r in res:
ii = ii+1
f.write(str(ii) +'. '+r.get('entry-summary')[0].encode('utf-8').strip()+'\n\n\n')
f.close()
from pydipankar import Spider
res =[]
for i in range(1,4):
url = 'http://www.geeksforgeeks.org/category/matrix/page/%s/'%i
soup = Spider.buildSoup(url);
res += Spider.getAttrListForXPath(soup,'div#content article',None,{'entry-title':['.entry-title a','text'],'entry-summary':['.entry-summary','text']})
f = open('a.txt','w+')
ii =0
for r in res:
ii = ii+1
f.write(str(ii) +'. '+r.get('entry-summary')[0].encode('utf-8').strip()+'\n\n\n')
f.close()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pydipankar-1.0.tar.gz
(2.3 kB
view hashes)