Customers with an on-premise installation of Lilt will need to provision disk space according to their own translation volumes. This article provides guidance on a disk space rules-of-thumb and filetype distributions.
Stored Artifacts
When a file is initially imported into Lilt, Lilt stores the following artifacts:
- Source file
- Intermediate Okapi artifact
- Intermediate extracted XLIFF
- Translated target file
This results in an upper-bound disk space requirement of approximately 4x the total source file size.
That is, customers planning to translate 10 GB of documents using Lilt should provision at least 40 GB of hard drive space.
Typical Filesize Distribution
Customers may expect to translate predominantly text documents (txt, csv, html, xml, json) and desktop publishing formats (word, excel, pdf, etc), without knowing the distribution of files on their system.
A random sample of Lilt data shows that text-based file formats were the most common, but multimedia-heavy formats such as .pptx files used the most disk space: