Paperless

Daniel Quinn, London UK, March 2015

  @searchingfortao   |     danielquinn   |     danielquinn.org


Paper Sucks

Paper Loves Being Mysterious

There's No Ctrl+F

Paper Hates Your Spine

It's heavy and takes up space

Paper Wants to Breed

Backups? More Paper!

Paper Hates the Planet

It's 2016

We shouldn't even need to use this stuff anymore

So I made Paperless

What's it Do?

  • Consumes documents from a directory, API, or mailbox
  • OCRs them and puts the text in a database
  • Automatically tags the db record based on user-rules
  • Encrypts and stores the original
  • Generates, encrypts, and stores a thumbnail

It's Just a Wrapper

  • ImageMagick
  • Tesseract
  • GnuPG
  • Python3/Django

How's it Work?

There's two processes

1: The Consumer

(Imports documents)

  • Runs in a loop
  • Reads from a directory
  • Occasionally checking a mailbox

2: The Webserver

(Lets you find stuff)

  • Currently the Django Admin (I know)
  • Provides a DRF API
  • Users are building proper UIs

Demo!

Time to sacrifice a kitten

TODO

It works, but it could use polish

  • The UI is the Django admin
  • Mail consumption is really raw
  • Some sort of plugin architecture?
  • OMG Tests

Fork Me

  https://github.com/danielquinn/paperless