Skip to main content

Double-metaphone-like algorithm for the french language

Project description

What’s that?

dprj is an algorithm that extracts sounds from words. It’s similar to metaphone, except it’s targeted at the french language only. While double metaphone or similar can be ok to regroup content that sound more or less the same, it becomes unusable when you want to get the real pronunciation of a word, since it’s mainly targeted at english. And we all know french can be really tricky sometimes.

How to use it?

Install it from PyPI:

pip install dprj

Then just use the say function:

>> from dprj import say
>> say(u'chaussette')
u'xst'
>> say(u'psychologue')
u'psklg'
>> say(u'agglomération')
u'glmrsn'
>> say(u'accès')
u'ks'

How reliable is it?

Well, try it and see for yourself! There are still some cases to handle:

  • some plurals
  • some silent final letters (eg. ‘r’)
  • french weirdnesses

If you get incorrect results for any word please let me know by filing and issue.

Project details


Release history Release notifications

This version
History Node

0.1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
dprj-0.1.0.tar.gz (3.6 kB) Copy SHA256 hash SHA256 Source None Nov 4, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page