feat: Configurable liveness/readiness probe settings per pod #60
eliBenven
started this conversation in
Feature Requests
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
Nexlayer applies default liveness/readiness probes that kill containers if they don't respond within ~15-30 seconds. Many legitimate workloads need significantly longer startup times. There's currently no way to configure probe behavior in the launchfile.
Proposed API
Use Cases
1. Game Servers
Minecraft Paper server needs 60-90s to remap classes and generate world chunks. The Isle and ARK servers can take 2-3 minutes. Current probes kill them in a CrashLoopBackOff cycle (observed in stress test: Minecraft hit 146 restart attempts).
2. Java/JVM Applications
Spring Boot apps with large classpaths, Elasticsearch with index recovery, Kafka brokers — all commonly need 30-120s startup time.
3. ML Model Loading
Ollama pulling and loading a large model, or any ML inference server loading weights into memory — can take minutes on first boot.
4. Database Recovery
Postgres with WAL replay, Elasticsearch with shard recovery, MongoDB with journal replay — startup time scales with data volume and is unpredictable.
5. Non-HTTP Workloads
Game servers, MQTT brokers, and other TCP/UDP services don't serve HTTP at all. HTTP-based liveness probes will always fail. Need either TCP probes, exec probes, or the ability to disable probes entirely.
Evidence from Stress Test
References
Migrated from #57
Beta Was this translation helpful? Give feedback.
All reactions