Metadata-Version: 2.1
Name: financial-statements-downloader
Version: 0.1.1
Summary: Simple application for downloading financial statements from czech business register.
Home-page: https://github.com/jan-stanek/financial-statements-downloader
Author: Jan Staněk
Author-email: stanej16@fit.cvut.cz
License: MIT
Keywords: czech republic financial statements business register
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Topic :: Utilities
License-File: LICENSE
Requires-Dist: click
Requires-Dist: tika
Requires-Dist: tinydb
Requires-Dist: beautifulsoup4
Requires-Dist: html5lib

Financial statements downloader
===============================

Task
----
Create terminal application for downloading financial statements from czech business register (https://or.justice.cz/)
and extracting information from them and storing them in structured form.

Input is a file with ICOs, one ICO per line.

At first the application downloads pdfs with financial statements. In the next step it extracts information specified in
config file and stores them in structured form.

The application must allow downloading documents of large number of companies, while respecting information system
limits (https://or.justice.cz/ias/ui/podminky).

Installation
------------
* ``pip install financial-statements-downloader``

Commands
--------
* ``import_icos <filepath>`` - imports ICOs from file
* ``download`` - downloads information about subjects and documents
* ``extract`` - extract information from financial statements

Config
------
::

  [downloader]
  base_url = https://or.justice.cz/ias/ui/  # information system url
  search_url = rejstrik-$firma?jenPlatne=VSECHNY&ico=  #search url
  limit_day = 3000  # download limit
  download_extract = true  # download information from extract
  download_documents = true  # download documents
  documents_type = závěrka  # document type
  documents_dir = documents  # directory for downloaded documents

  [parser]  # example of specification extracted information
  vlastni_kapital = Vlastní kapitál
  cizi_zdroje = Cizí zdroje


