Paperless

  @searchingfortao   |     danielquinn   |     danielquinn.org


My Inspiration


Paper Sucks

Paper Loves Being Mysterious

There's No Ctrl+F

Paper Hates Your Spine

It's heavy and takes up space

Paper Wants to Breed

Backups? More Paper!

Paper Hates the Planet

It's 2016

We shouldn't even need to use this stuff anymore

So I made Paperless

What's it Do?

  • Consumes documents from a directory, API, or mailbox
  • OCRs them and puts the text in a database
  • Automatically tags the db record based on user-rules
  • Encrypts and stores the original

It's Just a Wrapper

  • ImageMagick
  • Tesseract
  • GnuPG
  • Python3/Django

How's it Work?

1: The Consumer

(Imports documents)

  • Runs in a loop
  • Reads from a directory
  • Occasionally checking a mailbox

2: The Webserver

(Lets you find stuff)

  • Currently the Django Admin (I know)
  • Provides a DRF API
  • Users are building proper UIs

Demo!

TODO

It works, but it needs polish

  • The UI is the Django admin
  • Mail consumption is really raw
  • Some sort of plugin architecture?
  • OMG Tests

Fork Me

  https://github.com/danielquinn/paperless