Overview

Details of specific measurements used to assess and quantify the resource utilization of LILT applications. These metrics provide insights into the performance, capacity, and scalability requirements of the applications. They include parameters such as CPU utilization, memory consumption, disk size and concurrent user sessions. By establishing baseline measurements and setting thresholds, usage-sizing metrics help identify normal operating levels and define boundaries for resource usage. These metrics enable proactive capacity planning, resource optimization, and the ability to detect anomalies or performance bottlenecks. The table below is meant to be a reference guide, and not an exact match to your individual environment, as each installation may choose to enable or disable specific services.

Usage-Sizing Metrics

App NameMin Recommended vCPUsMax Recommended vCPUsMin Recommended Memory (GB)Max Recommended Memory (GB)GPU Requirements
Web Application Services
Front0.5248
Beehive0.1444
Core Translation Services
Core-api0.010.10.2560.256
Converter1122628
QA0.5134
Linguist0.5112
Search0.05n/a2122
TM4123032
TB1n/a3030
Batch-TB272230
Indexer0.05n/a2122
Lexicon483030
Watchdog0.1n/a23
Segment0.5244
File-translation361415
Job0.5123.5
Tag0.5123.5
Auditlog0.5123.5
Assignment1244
Workflow1222
File-job241616
Memory1244.5
Notification0.5n/a24
Auth0.01n/a0.51
Neural/ML Services
Translatev4****1225392 GPUs
Updatev44328080
Langid0.5355
Update-managerv40.250.511
Routing0.531.51.5
Batchv40.250.511
Batch-worker-gpuv4****1224241 GPU
LLM-inference****1n/a4304 GPUs
Automqm1151010
Alignment4488
Tag-projection4488
NNcache3322
VMF (Vector Manipulation)****22881 GPU
Connectors Platform
Connectors-ingressgateway0.120.1281
Connectors-buildern/an/an/an/a
Infrastructure Services
nginx-ingress0.1n/a0.128n/a
istiod0.50.522
istio-ingressgateway0.120.1281
Qdrant***4n/a1212
Prometheusn/a0.5n/a2
Grafanan/a0.5n/a0.5
Data Stores
Redis0.20.6410
RabbitMQ1244
MySQL0.5n/a48
MongoDB0.50.50.50.5
MinIO0.10.511.5
ClickHouse0.5n/a116
Elasticsearch***1244
Total*58.12 vCPUs200.1 vCPUs656.384 GB RAM784.256 GB RAM8 GPUs
Total Disk Requirements*1 TB minimum
*The following specifications are recommended for installing LILT on bare-metal hardware to support the given applications. It is important to note that these specifications may need to be adjusted based on the specific load requirements.
**To efficiently handle parallel batch requests, it is recommended to have the same number of GPUs as the number of required concurrent batch requests. This enables the system to effectively distribute the workload and process multiple requests simultaneously, leveraging the processing power of each GPU to improve throughput.
*** These values may be doubled when multiple replicas are configured
**** These pods require GPU nodes for operation
n/a marked services have no specified limit in the configuration and can use available resources on the node. Services marked with n/a for both min and max have no explicit resource specifications defined in the helm charts and will use default values.