mirror of
https://github.com/paperless-ngx/paperless-ngx.git
synced 2025-04-02 13:45:10 -05:00

commit b1410a854e03087023c89998b14c3296ac669f1f Merge: f9ce4d8f 8ec9c77e Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Thu Dec 29 20:09:09 2022 -0800 Merge pull request #2263 from paperless-ngx/v1.11.0-changelog [Documentation] Add v1.11.0 changelog commit 8ec9c77e51dc492f6b7f468ab533204848a554b3 Author: github-actions <41898282+github-actions[bot]@users.noreply.github.com> Date: Fri Dec 30 04:08:17 2022 +0000 Changelog v1.11.0 - GHA commit f9ce4d8f6a9086d21f7f9c5411a28dd8b0b7135e Author: Michael Shamoon <4887959+shamoon@users.noreply.github.com> Date: Thu Dec 29 19:40:25 2022 -0800 Update version strings for 1.11.0 commit 8c9a74ee0ca03d1f1afd7dee9203648d48bb33c1 Merge: 605f86f0 0b59ef2c Author: Michael Shamoon <4887959+shamoon@users.noreply.github.com> Date: Thu Dec 29 19:39:38 2022 -0800 Merge branch 'dev' commit 605f86f0cfb908761d2f71d7e17c1e60668b7edf Merge: 800e842a 8cbaca22 Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Wed Dec 28 15:55:35 2022 -0800 Merge pull request #2256 from mendelk/patch-1 Fixed typo in docs commit 8cbaca22c12b5f3129b52a376dd56f00600f27be Author: Mendel Kramer <mendelk@users.noreply.github.com> Date: Wed Dec 28 18:16:00 2022 -0500 Fixed typo in docs commit 800e842ab304ce2fcb1c126d491dac0770ad66ff Author: ThellraAK <github.com@absurdlybored.com> Date: Wed Dec 21 01:36:37 2022 -0900 Removing Mariadb default open port (#2227) * Removing Mariadb default open port Removing the listening port 3306 for the DB, Docker networks will let the containers talk to one another. The existing setup would allow anyone to connect to the DB and use the default passwords. * Update docker-compose.mariadb-tika.yml Adding change to the other compose file to remove open port * Remove excess blank lines * Remove excess blank lines Co-authored-by: Felix E <felix@eckhofer.com> commit 6f6f365e2b36410110275ca92b5ba467500bb577 Merge: 6d324dbd 43b863b8 Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Sat Dec 17 19:58:06 2022 -0800 Merge pull request #2203 from tooomm/docs_updates Docs: More fixes and improvements commit 43b863b816337dd19dd9b903e76ecf50b47f1583 Author: tooomm <tooomm@users.noreply.github.com> Date: Sun Dec 11 19:44:18 2022 +0100 doc fixes This reverts commit e015babdc102a65a3cce0cc71812d3eb730da92e. link fix fix escaping, spacing, profile links, typo revert ~~add~~ at fixes Revert "~~add~~ at fixes" This reverts commit ce0192b733c19614048de81ea917660e25bb35f2. commit 6d324dbd8e73c5acdd3b53fd9013c70c53d012e1 Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Fri Dec 16 09:10:11 2022 -0800 Update config.yml commit 8ddf05e573c4bc2a55ef6d20f5e36181ccf534b5 Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Fri Dec 16 09:09:48 2022 -0800 Update bug-report.yml commit 0472dfe25a02b3bc9b148f435bcda6e2e2987355 Author: tooomm <tooomm@users.noreply.github.com> Date: Sun Dec 11 19:12:58 2022 +0100 Docs: Fix leftover issues from conversion (#2172) commit 8b36c9ad64bb7638e33d9cb22217f3d8345d5c1e Author: tooomm <tooomm@users.noreply.github.com> Date: Sun Dec 11 16:07:08 2022 +0100 more fixes and cleanup commit 1266f2d5b948b7d99dab267e34840ece6a3fbaa4 Author: tooomm <tooomm@users.noreply.github.com> Date: Sun Dec 11 12:06:15 2022 +0100 fix links commit 81960519592095df714fb0e0f7a0e907488fa269 Merge: 06a6eb03 d198142a Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Fri Dec 9 16:12:20 2022 -0800 Merge pull request #2157 from Weltraumschaf/patch-1 Update setup.md commit d198142a1ef8cdcaa0d19d126d67b4ade754fceb Author: Sven Strittmatter <ich@weltraumschaf.de> Date: Fri Dec 9 22:09:06 2022 +0100 Update setup.md W/o the slash it resolves to /setup/configuration/ which does 404. commit 06a6eb0326af6eb3bbe523b0c0061fc324578834 Author: Michael Shamoon <4887959+shamoon@users.noreply.github.com> Date: Fri Dec 9 08:15:03 2022 -0800 fix code block indentation commit 28819d6d0fb77b8f6030865b0c0d2a1b74a39cad Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Fri Dec 9 08:11:42 2022 -0800 Fix code block indentation commit 8cd5e25364768512af90c773c6a2d307cf59febe Merge: 32d54674 7788d932 Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Tue Dec 6 11:23:15 2022 -0800 Merge pull request #2137 from paperless-ngx/more-docs-cleanup Chore: Cleanup of new documentation commit 7788d932275fd108f6ab9425b1daeabd2c931422 Author: Trenton Holmes <797416+stumpylog@users.noreply.github.com> Date: Sun Dec 4 08:34:49 2022 -0800 Further cleanup of docs, including fixing autoconvert issues and general cleanups commit 32d546740bd4f086369d1a81ddb6658b2f9298b0 Merge: b0ca57a7 24da3e50 Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Sun Dec 4 19:12:27 2022 -0800 Merge pull request #2118 from alexander-bauer/chart-bump commit 24da3e50342d3494ba93c83a601c8f44c635e43d Author: Alexander Bauer <sasha@linux.com> Date: Mon Dec 5 02:51:35 2022 +0000 Bump Helm Chart version to trigger release commit b0ca57a7f0e5694f5442303e6b17cf6abe120f9a Merge: cdd49c51 c864b3cd Author: shamoon <4887959+shamoon@users.noreply.github.com> Date: Sun Dec 4 14:36:00 2022 -0800 Merge pull request #2114 from paperless-ngx/v1.10.2-changelog [Documentation] Add v1.10.2 changelog commit cdd49c51426e0de8937210a65e717fb46eea6101 Author: Michael Shamoon <4887959+shamoon@users.noreply.github.com> Date: Sun Dec 4 14:32:08 2022 -0800 Update frontend compilation info commit c864b3cd19da3dc37f2f3ba3afa34cfcb73892a8 Author: github-actions <41898282+github-actions[bot]@users.noreply.github.com> Date: Sun Dec 4 21:17:16 2022 +0000 Changelog v1.10.2 - GHA
319 lines
10 KiB
Markdown
319 lines
10 KiB
Markdown
# The REST API
|
|
|
|
Paperless makes use of the [Django REST
|
|
Framework](https://django-rest-framework.org/) standard API interface. It
|
|
provides a browsable API for most of its endpoints, which you can
|
|
inspect at `http://<paperless-host>:<port>/api/`. This also documents
|
|
most of the available filters and ordering fields.
|
|
|
|
The API provides 5 main endpoints:
|
|
|
|
- `/api/documents/`: Full CRUD support, except POSTing new documents.
|
|
See below.
|
|
- `/api/correspondents/`: Full CRUD support.
|
|
- `/api/document_types/`: Full CRUD support.
|
|
- `/api/logs/`: Read-Only.
|
|
- `/api/tags/`: Full CRUD support.
|
|
- `/api/mail_accounts/`: Full CRUD support.
|
|
- `/api/mail_rules/`: Full CRUD support.
|
|
|
|
All of these endpoints except for the logging endpoint allow you to
|
|
fetch, edit and delete individual objects by appending their primary key
|
|
to the path, for example `/api/documents/454/`.
|
|
|
|
The objects served by the document endpoint contain the following
|
|
fields:
|
|
|
|
- `id`: ID of the document. Read-only.
|
|
- `title`: Title of the document.
|
|
- `content`: Plain text content of the document.
|
|
- `tags`: List of IDs of tags assigned to this document, or empty
|
|
list.
|
|
- `document_type`: Document type of this document, or null.
|
|
- `correspondent`: Correspondent of this document or null.
|
|
- `created`: The date time at which this document was created.
|
|
- `created_date`: The date (YYYY-MM-DD) at which this document was
|
|
created. Optional. If also passed with created, this is ignored.
|
|
- `modified`: The date at which this document was last edited in
|
|
paperless. Read-only.
|
|
- `added`: The date at which this document was added to paperless.
|
|
Read-only.
|
|
- `archive_serial_number`: The identifier of this document in a
|
|
physical document archive.
|
|
- `original_file_name`: Verbose filename of the original document.
|
|
Read-only.
|
|
- `archived_file_name`: Verbose filename of the archived document.
|
|
Read-only. Null if no archived document is available.
|
|
|
|
## Downloading documents
|
|
|
|
In addition to that, the document endpoint offers these additional
|
|
actions on individual documents:
|
|
|
|
- `/api/documents/<pk>/download/`: Download the document.
|
|
- `/api/documents/<pk>/preview/`: Display the document inline, without
|
|
downloading it.
|
|
- `/api/documents/<pk>/thumb/`: Download the PNG thumbnail of a
|
|
document.
|
|
|
|
Paperless generates archived PDF/A documents from consumed files and
|
|
stores both the original files as well as the archived files. By
|
|
default, the endpoints for previews and downloads serve the archived
|
|
file, if it is available. Otherwise, the original file is served. Some
|
|
document cannot be archived.
|
|
|
|
The endpoints correctly serve the response header fields
|
|
`Content-Disposition` and `Content-Type` to indicate the filename for
|
|
download and the type of content of the document.
|
|
|
|
In order to download or preview the original document when an archived
|
|
document is available, supply the query parameter `original=true`.
|
|
|
|
!!! tip
|
|
|
|
Paperless used to provide these functionality at `/fetch/<pk>/preview`,
|
|
`/fetch/<pk>/thumb` and `/fetch/<pk>/doc`. Redirects to the new URLs are
|
|
in place. However, if you use these old URLs to access documents, you
|
|
should update your app or script to use the new URLs.
|
|
|
|
## Getting document metadata
|
|
|
|
The api also has an endpoint to retrieve read-only metadata about
|
|
specific documents. this information is not served along with the
|
|
document objects, since it requires reading files and would therefore
|
|
slow down document lists considerably.
|
|
|
|
Access the metadata of a document with an ID `id` at
|
|
`/api/documents/<id>/metadata/`.
|
|
|
|
The endpoint reports the following data:
|
|
|
|
- `original_checksum`: MD5 checksum of the original document.
|
|
- `original_size`: Size of the original document, in bytes.
|
|
- `original_mime_type`: Mime type of the original document.
|
|
- `media_filename`: Current filename of the document, under which it
|
|
is stored inside the media directory.
|
|
- `has_archive_version`: True, if this document is archived, false
|
|
otherwise.
|
|
- `original_metadata`: A list of metadata associated with the original
|
|
document. See below.
|
|
- `archive_checksum`: MD5 checksum of the archived document, or null.
|
|
- `archive_size`: Size of the archived document in bytes, or null.
|
|
- `archive_metadata`: Metadata associated with the archived document,
|
|
or null. See below.
|
|
|
|
File metadata is reported as a list of objects in the following form:
|
|
|
|
```json
|
|
[
|
|
{
|
|
"namespace": "http://ns.adobe.com/pdf/1.3/",
|
|
"prefix": "pdf",
|
|
"key": "Producer",
|
|
"value": "SparklePDF, Fancy edition"
|
|
}
|
|
]
|
|
```
|
|
|
|
`namespace` and `prefix` can be null. The actual metadata reported
|
|
depends on the file type and the metadata available in that specific
|
|
document. Paperless only reports PDF metadata at this point.
|
|
|
|
## Authorization
|
|
|
|
The REST api provides three different forms of authentication.
|
|
|
|
1. Basic authentication
|
|
|
|
Authorize by providing a HTTP header in the form
|
|
|
|
```
|
|
Authorization: Basic <credentials>
|
|
```
|
|
|
|
where `credentials` is a base64-encoded string of
|
|
`<username>:<password>`
|
|
|
|
2. Session authentication
|
|
|
|
When you're logged into paperless in your browser, you're
|
|
automatically logged into the API as well and don't need to provide
|
|
any authorization headers.
|
|
|
|
3. Token authentication
|
|
|
|
Paperless also offers an endpoint to acquire authentication tokens.
|
|
|
|
POST a username and password as a form or json string to
|
|
`/api/token/` and paperless will respond with a token, if the login
|
|
data is correct. This token can be used to authenticate other
|
|
requests with the following HTTP header:
|
|
|
|
```
|
|
Authorization: Token <token>
|
|
```
|
|
|
|
Tokens can be managed and revoked in the paperless admin.
|
|
|
|
## Searching for documents
|
|
|
|
Full text searching is available on the `/api/documents/` endpoint. Two
|
|
specific query parameters cause the API to return full text search
|
|
results:
|
|
|
|
- `/api/documents/?query=your%20search%20query`: Search for a document
|
|
using a full text query. For details on the syntax, see [Basic Usage - Searching](/usage#basic-usage_searching).
|
|
- `/api/documents/?more_like=1234`: Search for documents similar to
|
|
the document with id 1234.
|
|
|
|
Pagination works exactly the same as it does for normal requests on this
|
|
endpoint.
|
|
|
|
Certain limitations apply to full text queries:
|
|
|
|
- Results are always sorted by search score. The results matching the
|
|
query best will show up first.
|
|
- Only a small subset of filtering parameters are supported.
|
|
|
|
Furthermore, each returned document has an additional `__search_hit__`
|
|
attribute with various information about the search results:
|
|
|
|
```
|
|
{
|
|
"count": 31,
|
|
"next": "http://localhost:8000/api/documents/?page=2&query=test",
|
|
"previous": null,
|
|
"results": [
|
|
|
|
...
|
|
|
|
{
|
|
"id": 123,
|
|
"title": "title",
|
|
"content": "content",
|
|
|
|
...
|
|
|
|
"__search_hit__": {
|
|
"score": 0.343,
|
|
"highlights": "text <span class="match">Test</span> text",
|
|
"rank": 23
|
|
}
|
|
},
|
|
|
|
...
|
|
|
|
]
|
|
}
|
|
```
|
|
|
|
- `score` is an indication how well this document matches the query
|
|
relative to the other search results.
|
|
- `highlights` is an excerpt from the document content and highlights
|
|
the search terms with `<span>` tags as shown above.
|
|
- `rank` is the index of the search results. The first result will
|
|
have rank 0.
|
|
|
|
### `/api/search/autocomplete/`
|
|
|
|
Get auto completions for a partial search term.
|
|
|
|
Query parameters:
|
|
|
|
- `term`: The incomplete term.
|
|
- `limit`: Amount of results. Defaults to 10.
|
|
|
|
Results returned by the endpoint are ordered by importance of the term
|
|
in the document index. The first result is the term that has the highest
|
|
[Tf/Idf](https://en.wikipedia.org/wiki/Tf%E2%80%93idf) score in the index.
|
|
|
|
```json
|
|
["term1", "term3", "term6", "term4"]
|
|
```
|
|
|
|
## POSTing documents {#file-uploads}
|
|
|
|
The API provides a special endpoint for file uploads:
|
|
|
|
`/api/documents/post_document/`
|
|
|
|
POST a multipart form to this endpoint, where the form field `document`
|
|
contains the document that you want to upload to paperless. The filename
|
|
is sanitized and then used to store the document in a temporary
|
|
directory, and the consumer will be instructed to consume the document
|
|
from there.
|
|
|
|
The endpoint supports the following optional form fields:
|
|
|
|
- `title`: Specify a title that the consumer should use for the
|
|
document.
|
|
- `created`: Specify a DateTime where the document was created (e.g.
|
|
"2016-04-19" or "2016-04-19 06:15:00+02:00").
|
|
- `correspondent`: Specify the ID of a correspondent that the consumer
|
|
should use for the document.
|
|
- `document_type`: Similar to correspondent.
|
|
- `tags`: Similar to correspondent. Specify this multiple times to
|
|
have multiple tags added to the document.
|
|
|
|
The endpoint will immediately return "OK" if the document consumption
|
|
process was started successfully. No additional status information about
|
|
the consumption process itself is available, since that happens in a
|
|
different process.
|
|
|
|
## API Versioning
|
|
|
|
The REST API is versioned since Paperless-ngx 1.3.0.
|
|
|
|
- Versioning ensures that changes to the API don't break older
|
|
clients.
|
|
- Clients specify the specific version of the API they wish to use
|
|
with every request and Paperless will handle the request using the
|
|
specified API version.
|
|
- Even if the underlying data model changes, older API versions will
|
|
always serve compatible data.
|
|
- If no version is specified, Paperless will serve version 1 to ensure
|
|
compatibility with older clients that do not request a specific API
|
|
version.
|
|
|
|
API versions are specified by submitting an additional HTTP `Accept`
|
|
header with every request:
|
|
|
|
```
|
|
Accept: application/json; version=6
|
|
```
|
|
|
|
If an invalid version is specified, Paperless 1.3.0 will respond with
|
|
"406 Not Acceptable" and an error message in the body. Earlier
|
|
versions of Paperless will serve API version 1 regardless of whether a
|
|
version is specified via the `Accept` header.
|
|
|
|
If a client wishes to verify whether it is compatible with any given
|
|
server, the following procedure should be performed:
|
|
|
|
1. Perform an _authenticated_ request against any API endpoint. If the
|
|
server is on version 1.3.0 or newer, the server will add two custom
|
|
headers to the response:
|
|
|
|
```
|
|
X-Api-Version: 2
|
|
X-Version: 1.3.0
|
|
```
|
|
|
|
2. Determine whether the client is compatible with this server based on
|
|
the presence/absence of these headers and their values if present.
|
|
|
|
### API Changelog
|
|
|
|
#### Version 1
|
|
|
|
Initial API version.
|
|
|
|
#### Version 2
|
|
|
|
- Added field `Tag.color`. This read/write string field contains a hex
|
|
color such as `#a6cee3`.
|
|
- Added read-only field `Tag.text_color`. This field contains the text
|
|
color to use for a specific tag, which is either black or white
|
|
depending on the brightness of `Tag.color`.
|
|
- Removed field `Tag.colour`.
|