Troubleshooting
Common Issues
Deployment Unit Not Found
DeploymentUnitNotFoundException: No deployment unit found
-
Ensure deployment completed successfully
-
Verify model deployment with
cluster unit list -
Check name and version in job parameters and compare it with the deployment unit name and version
-
Ensure you have actual model files (not
.metadataor.lockfiles) -
Check that files are readable and not Git LFS artifacts
Model Not Found
ModelNotFoundException: No model found matching criteria
-
Make sure that the deployment unit exists
-
Make sure that the type parameter in the JobParameters matches the engine that the model has been created on (pytorch, tensorflow, onnx)
-
Verify you have the correct model files for your model type:
-
TensorFlow models:
*.pb -
ONNX models:
*.onnxfile (self-contained, no config needed)
-
Model Initialization Exception
ModelInitializationException:
Files to look for:
The essential files vary by model type and framework:
-
Model weights:
.onnx, or.pb, or*.pt -
Model config:
config.json(except for ONNX - they are self-contained models) -
Model variable directory
model-directory/ ├── saved_model.pb └── variables/ ├── variables.data-00000-of-00001 └── variables.index
Additionally, there could be Files (model-dependent):
-
Tokenizer:
tokenizer.json,tokenizer_config.json(for some models) -
Vocabulary:
vocab.txt,special_tokens_map.json(some NLP models)
Translator Not Found
TranslatorNotFoundException:
-
Make sure that you have used the correct translator for the model
-
Make sure your custom compute job and marshaller is deployed correctly (optional: in case there is user-defined input/output/translator/translatorFactory)
-
Make sure that the input and output classes match the translator that has been used
Memory Issues
OutOfMemoryError during model loading/deploying
-
Increase JVM heap size as per the size of the model that you have deployed
-
Consider model quantization for large models
© 2025 GridGain Systems, Inc. All Rights Reserved. Privacy Policy | Legal Notices. GridGain® is a registered trademark of GridGain Systems, Inc.
Apache, Apache Ignite, the Apache feather and the Apache Ignite logo are either registered trademarks or trademarks of The Apache Software Foundation.