Skip to main content

A sentiment analysis server implementation. Designed to be extendable, so new algorithms and sources can be used.

Project description

![GSI Logo](http://gsi.dit.upm.es/templates/jgsi/images/logo.png)
[DoCon](http://demos.gsi.dit.upm.es/docon)
==================================

Introduction
---------------------
This tool will take several input formats and translates them to semantic formats. It focuses on translating corpora to the NIF+[Marl](http://gsi.dit.upm.es/ontologies/marl) format, using json-ld.

DoCon is under heavy development. As of this writing, it supports:

* Creating and administrating translation templates (admin level)
* Editing templates to convert traditional formats (csv, tsv, xls, xml) formats to NIF+Marl+Onyx.
* Using the available templates to translate known formats through this portal or via POST requests
* Saving or outputting the result
* HTTP API
* Logging translation requests
* Auto selection of the best template based on the input format

In the future, we might include the following features:
* Conversion of semantic formats
* Automatic translation between semantic formats (e.g. [RDF](http://www.w3.org/RDF/) to [JSON-LD](http://json-ld.org/))

Translating a document
----------------------
Documents can be translated via the Web Interface, through the REST interface, or via Command-Line.

The Generator endpoint takes the following parameters:

* input (i): The original file to be translated
* informat (f): The format of the original file
* intype (t) [Optional]:
* direct (default)
* url
* file
* outformat (o):
* json-ld
* rdfxml
* turtle (default, to comply with NIF)
* ntriples
* trix
* base URI (u) [Optional]: base URI to use for the corpus
* prefix (p) [Optional]: prefix to replace the base URI
* language (l) [Optional]: language code (see dc:terms and [ISO 639](http://en.wikipedia.org/wiki/List_of_ISO_639-1_codes) )
* template (t) [Optional]: ID of the template to use. If it is omitted, a template to convert from informat to outformat will be used, or a template from informat to another format (e.g. json-ld), with automatic conversion (*to be done*).
* toFile [Optional]: Whether the result should be sent in the response (default) or written to a file. For convenience, this value defaults to False when using the Web Form.

Using the command line tool *curl*, a request can be made like this:

curl -F"template=Example_to_Marl" -F"input=@input-file.csv" -F"intype=FILE"
http://demos.gsi.dit.upm.es/docon/process
> result.jsonld

Command-line tool
-----------------
In addition to the methods above, this tool can be used directly in the command line.
Just install the package and run:

docon -i <file to be converted> --template <conversion template> -o <output>

If you don't want to install te package, or prefer to run it as a module, you can also run it like so:

python -m docon.cli -i <file to be converted> --template <conversion template> -o <output>

Or use the associated script:


Installation instructions
------------------------------
This repository contains all the code necessary to run docon. To install it, follow the following steps:

* Copy the docon/settings-private.py.template to docon/settings-private.py
* Add your database information to settings.py
* Create a virtualenv (preferably, in the project root)
* Install the required packages:

pip install -r requirements.txt

* Test the environment with:

python manage.py runserver localhost:<PORT>


If the standalone server works, you can try serving the portal via apache/nginx and WSGI. It has been tested with apache2 and uwsgi. In that case you will also need to serve the static files from your web server. An example configuration for Apache2 would be:

```
<VirtualHost *:80>

[ ... ]

WSGIScriptAlias /docon /path_to_docon/wsgi.py
WSGIDaemonProcess docon user=www-data group=www-data processes=nprocesses threads=nthreads python-path=/path_to_docon:/path_to_docon/venv/lib/python2.7/site-packages
WSGIProcessGroup docon
<Directory /path_to_docon>
Order allow,deny
Allow from all
</Directory>

Alias /docon/robots.txt /path_to_docon/static/robots.txt
Alias /docon/favicon.ico /path_to_docon/static/favicon.ico

AliasMatch ^docon/([^/]*\.css) /path_to_docon/static/styles/$1

Alias /docon/media/ /path_to_docon/media/
Alias /docon/static/ /path_to_docon/static/

<Directory /path_to_docon/static>
Order deny,allow
Allow from all
Options -Indexes
</Directory>

<Directory /path_to_docon/media>
Order deny,allow
Allow from all
Options -Indexes
</Directory>

</VirtualHost>
```

Acknowledgement
---------------
EUROSENTIMENT PROJECT
Grant Agreement no: 296277
Starting date: 01/09/2012
Project duration: 24 months

![Eurosentiment Logo](logo_grande.png)
![FP7 logo](logo_fp7.gif)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docon-0.1.1.tar.gz (77.3 kB view details)

Uploaded Source

File details

Details for the file docon-0.1.1.tar.gz.

File metadata

  • Download URL: docon-0.1.1.tar.gz
  • Upload date:
  • Size: 77.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for docon-0.1.1.tar.gz
Algorithm Hash digest
SHA256 ef7df5ff6ea88fb7184b20c5791e1dd2165e1e5ddc339e42a80ce8cf09294fd1
MD5 3335bc81b924ee6cf6214c09a2a404da
BLAKE2b-256 e108e9c43c4248757740c5ef87cdb45835fa3a22520b2c2a3ad0201ae87cda2f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page