Skip to main content

Library for Insight project

Project description

Lazy prices: an NLP strategy to investing

Investing strategies can be extremely complex. Using advanced mathematical models, investors try relentlessly to "Beat the Street" and gain on edge on each other. One piece of this puzzle are the 10-X filings that listed companies submit at the end of each quarter to report on their operations. They contain a wealth of quantitative information on their financial health.

Unfornately, they have gotten longer and longer over time, which makes reading one a daunting task. From the boilerplate text to incomplete sections, their usefulness has substantially decreased. But has it?

In this project, I attempt to disregard all financial data in the 10-X and build a virtual portfolio of companies based only on text. Let's see how that performs!

Loosely based on:

Cohen, Lauren and Malloy, Christopher J. and Nguyen, Quoc, Lazy Prices (March 7, 2019). 2019 Academic Research Colloquium for Financial Planning and Related Disciplines.

Available at SSRN: https://ssrn.com/abstract=1658471

What is EDGAR (from the SEC's website)?

EDGAR is the Electronic Data Gathering, Analysis, and Retrieval system used at the U.S. Securities and Exchange Commission (SEC). EDGAR is the primary system for submissions by companies and others who are required by law to file information with the SEC.

Containing millions of company and individual filings, EDGAR benefits investors, corporations, and the U.S. economy overall by increasing the efficiency, transparency, and fairness of the securities markets. The system processes about 3,000 filings per day, serves up 3,000 terabytes of data to the public annually, and accommodates 40,000 new filers per year on average.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secScraper-0.0.39.tar.gz (47.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

secScraper-0.0.39-py3-none-any.whl (67.0 kB view details)

Uploaded Python 3

File details

Details for the file secScraper-0.0.39.tar.gz.

File metadata

  • Download URL: secScraper-0.0.39.tar.gz
  • Upload date:
  • Size: 47.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/2.7.15+

File hashes

Hashes for secScraper-0.0.39.tar.gz
Algorithm Hash digest
SHA256 11fa784761db5e7c653cbaf48f2bb318aa8e475c4a72a52749984554407f9f4b
MD5 e905d9f8f42a02913e436fa1a6f22bee
BLAKE2b-256 7bd69bf630dba684f4c5fbe131e276110a16f7d1fec4a720feaa86a37343f9ef

See more details on using hashes here.

File details

Details for the file secScraper-0.0.39-py3-none-any.whl.

File metadata

  • Download URL: secScraper-0.0.39-py3-none-any.whl
  • Upload date:
  • Size: 67.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/2.7.15+

File hashes

Hashes for secScraper-0.0.39-py3-none-any.whl
Algorithm Hash digest
SHA256 a0855eeb00b92957c04c69354afdf35e07a7fad940c88ad80e9f6127691b0af7
MD5 2ebbb8ff59a06bc0f6c7b6ea045a025d
BLAKE2b-256 4c3cb71afa9de8a495ae691bb0f706f89ab9039be1afdcbc57526e2151dd166d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page