Rossum’s data extraction billing can be complex, but understanding which documents or pages are included in the billing is essential to managing your costs. Major aspect of our subscription is the volume allowance for document processing.
The fundamental principle of Rossum’s data extraction billing is that we charge for pages processed by our platform. This means that we count each page that you import into Rossum. Additionally, the billing mode is determined by the data capture configuration.
Which Documents or Pages Are Included in the Billing?
A document or page is billed when it is processed by Rossum’s Extraction Engine. This processing happens when the document moves from the importing state to either the review, confirmed, or exported state, based on your automation settings.
We charge for every page of a document, even if some pages do not contain captured data. This billing applies up to the maximum number of pages configured for processing by the Extraction Engine, which defaults to the first 32 pages.
Documents and Pages Included in the Billing:
Newly Uploaded Documents: Documents are billed based on the number of pages processed by the Extraction Engine, up to the queue’s page limit. If a document has 50 pages and the queue limit is 32 pages, only pages 1-32 are billed.
Duplicates: Duplicate documents are processed by the Extraction Engine, and you will be billed for such documents.
Automatically Rejected Documents: Documents automatically rejected based on user-configured rules in Rossum are not billed.
Documents Moved to a Queue with a Higher Page Limit: If a document with 50 pages is initially processed in a queue with a 32-page limit, only the first 32 pages are billed at that time. If the document is later re-extracted in a queue with a higher page limit (e.g., 50 pages), all 50 pages will be billed. In this case, you will be charged for the 50 pages plus the initial 32 pages already billed.
Successfully Re-Extracted Documents: Documents reprocessed by selecting ‘Re-extract’ or moving them to a different queue with the ‘reset’ checkbox ticked will be billed based on the page limit of the new queue.
Which Documents or Pages Are Not Included in the Billing?
While Rossum charges for most documents and pages processed by the Extraction Engine, the following exceptions apply:
Documents in the Failed Import State
Deleted Documents: Documents marked for deletion using the Delete recommendations feature are not billed if you choose to delete them.
Documents Moved Between Tabs (Statuses) Within the Same Queue without Re-extraction: Changing a document’s status, for example from “Postponed” to “To Review,” without re-extracting, does not result in additional charges.
Pages Above the Queue Limit: If a document has 50 pages but the queue limit is 32 pages, only the first 32 pages will be billed. The remaining pages beyond this limit are not billed.
Billing with Document Splitting
For documents uploaded with many pages and then split into several records, either manually by editing documents or by using the document splitting extension, you won’t be billed twice. Here are some examples to explain the billing rules:
Original Document Length | Split Into | Pages Deleted in the Process | Amount of Pages Included in the Billing |
42 pages | 2 docs (20 pages and 22 pages) | 0 | 42 pages |
42 pages | 2 docs (20 pages and 20 pages) | 2 | 40 pages |
Billing with Splitting & Sorting Engine
You are billed for all pages processed by the Splitting & Sorting Engine, up to the page limit set for the queue. Please note that this billing is based on your Splitting & Sorting Engine volume and is separate from your Extraction Engine volume.
Limiting data extraction to first N pages
It is possible to configure the amount of pages included in the initial data extraction. You can specify the limit using the rir_params
attribute of the queue object (using our API). For example, if you want to limit extraction to only the first page, you can set the effective_page_count
parameter to 1.
It’s important to note that the default limit for a new queue is set to 32 pages, so if you don’t change it, Rossum will extract data from up to 32 pages and charge you accordingly. The extraction limit can be set to a number that is lower or higher than 32.
By limiting data extraction to the first N pages, you can save on processing costs and improve efficiency, especially for large documents where you may only need data from the first few pages.
🆘 NEED MORE HELP?If you need help with setting up the limit, please contact us – support@rossum.ai.