Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (pypi.python.org).
Help us improve Python packaging - Donate today!

Generate CSV files containing random words from a dictionary

Project Description

Generate CSV files containing random words from a dictionary

Get the script from the buildout:

>>> import os
>>> gibberish = os.path.join(
...     reduce(lambda path, _: os.path.dirname(path),
...            range(3), __file__), 'bin', 'gibberish')

Print the help:

>>> from zc.buildout.testing import system
>>> print system(gibberish+' --help'),
usage: gibberish [options] LINES COLUMN [COLUMN ...]
<BLANKLINE>
Generate lines of CSV consisting of random words from a
dictionary.  The number of lines of CSV must be specified either
as a single integer to specify a fixed number of lines or two
integers separated by a dash to specify that a random number of
lines between the two integers should be used.  The columns are
specified in the same manner where the numbers represent the
number of words in that column for a given line.
<BLANKLINE>
options:
  -h, --help            show this help message and exit
  -w WORDS, --words=WORDS
                        File containing the words to be chosen
                        from [default: /usr/share/dict/words]

Make a simple file with one line and one column containing one word:

>>> import cStringIO, csv
>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 1 1'))))
>>> len(result)
1
>>> len(result[0])
1
>>> len(result[0][0].split())
1

Make sure that the newline is stripped:

>>> result[0][0][-1] != '\n'
True

With two words in the column:

>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 1 2'))))
>>> len(result)
1
>>> len(result[0])
1
>>> len(result[0][0].split())
2

With a random number of words in the column:

>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 1 1-10'))))
>>> len(result)
1
>>> len(result[0])
1
>>> 1 <= len(result[0][0].split()) <= 10
True

With 10 lines:

>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 10 2'))))
>>> len(result)
10
>>> len(result[0])
1
>>> len(result[0][0].split())
2

With a random number of lines:

>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 1-10 2'))))
>>> 1 <= len(result) <= 10
True
>>> len(result[0])
1
>>> len(result[0][0].split())
2

With two columns:

>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 1 2 3'))))
>>> len(result)
1
>>> len(result[0])
2
>>> len(result[0][0].split())
2
>>> len(result[0][1].split())
3

With a random number of words including zero in the column:

>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' 1 0-1'))))
>>> len(result)
1
>>> len(result[0])
1
>>> len(result[0][0].split()) in (0, 1)
True

With a small dictionary to test exhausting the dictionary:

>>> import tempfile
>>> _, tmp_path = tempfile.mkstemp()
>>> tmp = file(tmp_path, 'w')
>>> tmp.write('foo')
>>> tmp.close()
>>> result = tuple(csv.reader(cStringIO.StringIO(
...     system(gibberish+' -w %s 1 1' % tmp_path))))
>>> result
(['foo'],)
>>> os.remove(tmp_path)
Release History

Release History

This version
History Node

0.1

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
z3c.gibberish-0.1.tar.gz (4.9 kB) Copy SHA256 Checksum SHA256 Source Dec 18, 2007

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting